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WO«l/82f63 PCTAISOl/13806 
EPITOPE SYNCHRONEAIION IN ANTIGEN PKESENTING CELLS 

Background of the Invention 

Field of the Invention 

The invention disclosed herein relates to methods and compositions for inducing an 
antigen presenting ceU to present a particular target cell-specific epitope, thereby promoting an 
effective ototoxic T cell response to die target cell. 

The invention further relates to the identification of target cell epitopes and epitope 
chisters. and also to epitope-encoding vectors, that can be used to generate immunologically active 
pharmaceutical compositions. These compositions, when administered, can stimulate the immune 
system of a subject to mount an immune response against a target cell displaying the target antigen. 
The invention is therefore useful in the treatment and prevention of neoplastic and viral disease. 
Description of the Related Art 

Neoplasia and the Immune System 

The neoplastic disease state commonly known as cancer is thought to generally result from 
a single cell growing out of control. The uncontroUed growth state typically results from a multi- 
step process in which a series of cellular systems fail, resulting in the genesis of a neoplastic cell. 
Hie resulting neoplastic cell rapidly reproduces itself, fonns one or more tumors, and eventually 
m^ cause die death of tiie host. 

Because the progenitor of the neoplastic ceU shares tiie host's genetic material, neoplastic 
cells are largely exempt from the host's immune system. During immune surveillance, the process 
in which the host's immune system surveys and localizes foreign materials, a neoplastic cell will 
appear to liie host's immune surveillance machinery as a "self cell. 

Viruses and the Ttnmime System 

In contrast to cancer cells, virus infection involves the expression of clearly non-self 
antigens. As a result, many virus infections are successfully dealt with by the immune system wilh 
minimal clmical sequela. Moreover, it has been possible to develop effective vaccines for many of 
those mfections that do cause serious disease. A variety of vaccine approaches have been 
successfully used to combat various diseases. These approaches include subunit vaccines 
consisting of individual proteins produced through recombinant DNA technology. 
Notwithstanding these advances, the selection and effective administration of minimal epitopes for 
xise as viral vaccines has remained problematic. 

In addition to the difBculties involved in epitope selection stands tiie problem of viruses 
flmt have evolved the capability of evading a host's immune system. Many viruses, especially 
viruses that estabUsh persistent infections, such as members of the heipes and pox vims families. 
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produce immunomodulatory molecules that pennit the vims to evade Ae host's immune system. 
The effects of these immunomodulatory molecules on antigen presentation may be overcome by 
the targeting of select epitopes for administration as immunogenic compositions. To better 
understand the interaction of neoplastic cells and virally mfected cells with the host's immune 
5 system, a discussion of the system's components follows below. 

The immune system functions to discriminate molecules endogenous to an organism 
C'self molecules) fix>m material exogenous or foreign to tiie organism ^non-self' molecules). 
The immune system has two types of adaptive responses to foreign bodies based on the components 
that mediate tiie response: a humoral response and a cell-mediatedresponse. The humotal response is 

10 mediated by antibodies^ while the cell-mediated response involves cells classified as tjonphocytes. 
Recent anticancer and antivkal strategies have focused on mobilizing the host immune system as a 
means of anticanceror antiviral treatment or ther^y. 

The immune system functions in three phases to protect the host from foreign bodies: tbe 
cognitive phase, Ihe activation phase, and the effector phase. In the cognitive phase, tiie immune 

1 S system recognizes and signals tiie presence of a foreign antigen or invader in the body. The foreign 
antigen can be, for example, a cell surface marker from a neoplastic cell or a viral protein. Once the 
system is aware of an invading body, antigen specific cells of the immune system proliferate and 
differentiate in response to the invader-triggeredsignals. The last stage is the effector stage in which 
the effector cells of the immune system respond to and neutralize the detected invader. 

20 Anarrayofeffectorcellsimplementanimmuneresponsetoaninvader. One type of effector 

cell, the B cell, generates antibodies targeted against foreign antigens encountered by the host In 
combination with the complement system, antibodies direct the destruction of cells or organisms 
bearing the targeted antigen. Another type of effector cell is the natural killer cell (NK cell), a type of 
lymphocyte having the capacity to spontaneously recognize and destroy a variety of virus infected 

25 cells as well as malignant cell types. The method used by NK cells to recognize target cells is poorly 
understood. 

Another type of effector cell, tibe T cell, has members classified into three subcategories, each 
playing a different role in the hnmune response. Helper T cells secrete cytokines which stimulate the 
proliferation of other cells necessary for mounting an effective immune response, while suppressor T 
30 cells down-regulate the immune response. A thkd category of T cell, the cytotoxic T cell (CTL), is 
capable of directly lysing a targeted cell presenting a foreign antigen on its sur&ce. 
The Major Histocompatibilitv Complex and T Cell Target Recognition 
T cells are antigen specific iromune cells that function m response to specific antigen signals. 
B lymphocytes and the antibodies tiiey produce are also antigen specific entities. However, unlike B 
35 lymphocytes, T cells do not respond to antigens in a free or sohible form. For a T cell to respond to 
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an antigen, it requires Ihe antigen to be bound to a presenting complex known as the major 
histocompatibility complex (MHC). 

MHC complex proteins provide the means by which T cells differentiatenative or "self cells 
from foreign cells. There ate two types of MHC, class I MHC and class H MHC. T Helper cells 
(cm*) predominately interact with class H MHC proteins while cytolytic T cells (CDS') 
predominately interact with class I MHC prolems. Both MHC complexes are transmembrane 
proteins wift a majority of their structure on the external surface of the cell. Additionally, both 
classes of MHC have a peptide binding cleft on their external portions. It is m this cleft that smaU 
fragments of protein^ native or foreign, are bound and presented to the extracellular environment 

Cells caUed antigen presenting cells (APCs) display antigens to T cells using the MHC 
complexes. For T cells to recognize an antigen, it must be presented on the MHC complex for 
recognition. This requirement is called MHC restriction and it is the mechanism by which T cells 
differentiate "self from "non-self cells. If an antigen is not displayed by a recognizable MHC 
complex, tiie T cell will not recognize and act on the antigen signal. T cells specific for the peptide 
15 bound to a recognizable MHC complex bmd to these MHC-peptide complexes and proceed to the 
next stages of ^e immune response. 

As discussed above, neoplastic cells are largely ignored by the immune system. A great deal 

of effortis now being «q)endedm an attemptto harness a host's immune system to aid in combating 
the presence of neoplastic celb in a host One such area of research involves the fonnulation of 
20 anticancer vaccines. 

Anticancer Vaccines 

Among the various Weapons available to an oncologist in the battle against cancer is the 
immune Systran of the patient Work has been done in various attempts to cause the immune 
system to combat cancer or neoplastic diseases. Unfortunately, the results to date have been 
25 largely disappointing. One area of particular interest involves the generation and use of anticancer 
vaccines. 

To generate a vaccine or other immunogenic composition, it is necessary to mtroduce to a 
subject an antigrai or epitope agamst which an unmune response may be mounted. Although 
neoplastic cells are derived from and therefore are substantially identical to normal cells on a 
30 genetic level, many neoplastic cells are known to present tumor-associated antigens (TuAAs). In 
theory, these antigens could be used by a subject's unmune system to recognize these antigens and 
attack the neoplastic cells. Unfortunately, neoplastic cells appear to be ignored by the host's 
immune system. 

A number of different strategies have been developed in an attempt to generate vaccines 
35 wife activity against neoplastic ceUs. These strategies include the use of tumor associated antigens 
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as iimnunogens. For example, U.S. Patent No. 5,993,828. describes a method for producing an 
inumme response against a particular subunit of the Urinary Tumor Associated Antigen by 
administering to a subject an effective dose of a composition comprising inactivated tumor cells 
having the Urinary Tumor Associated Antigen on the ceU surface and at least one tmnor associated 
antigen selected from the group consisting of GM-2, GD-2, Fetal Antigen and Melanoma 
Associated Antigen. Accordmgly, this patent describes using whole, inactivated tumor cells as the 
immunogen m an anticancer vaccine. 

Another strategy used with anticancer vaccines involves administering a composition 
containing isolated tumor antigens. In one approach, MAGE-Al antigenic peptides were used as 
an unmunogen. (See Chaux, P., et al., "Identification of Five MAGE-Al Epitopes Recognized by 
Cytolytic T Lymphocytes Obtained by In Vitro Stimulation with Dendritic Cells Transduced with 
MAGE-Al," J. Immunol., I63(5):2928-2936 (1999)). There have been several therapeutic trials 
usmg MAGE-Al peptides for vaccination, although the effectiveness of the vaccination regimes 
was limited. The results of some of these trials are discussed in Vose, J.M., 'Tumor Antigens 
Recognized by T Lymphocytes," 10* European Cancer Conference, Day 2, Sept. 14, 1999. 

In anoflier example of tumor associated antigens used as vaccines, Scheinberg, et al. 
treated 12 chronic myelogenous leukemia (CML) patients already receiving interferon (IFN) or 
hydroxyurea with 5 injections of class I-associated bcr-abl peptides with a helper peptide plus the 
adjuvant QS.21. Schemberg, D.A., et al, "BCR-ABL Breakpoint Derived Oncogene Fusion 
Peptide Vaccmes Generate Specific Immune Responses in Patients with Chronic Myelogenous 
Leukemia (CML) [Abstract 1665], American Society of Clinical Oncology 35* Annual Meeting, 
Atlanta (1999). Proliferative and delayed type hypersensitivity (DTH) T cell responses indicative 
of T-helper activity were eUcited, but no cytolytic kiUer T cell activity was observed within the 
fresh blood samples. 

Additional examples of attempts to identify TAAs for use as vaccines are seen in the recent 
work of Cebon. et ed. and Scheibenbogen, et al. Cebon et al Immunized patients with metastatic 
melanoma using intradermallly administered MART-1,„, peptide with IL-12 in increasing doses 
given either subcutaneously or intravenously. Of the first 15 patients, 1 complete remission, 1 
partial remission, and 1 mixed response were noted. Immune assays for T ceU generation included 
DlH, which was seen in patients with or without IL-12. Positive CTL assays were seen in patients 
with evidence of clinical benefit, but not in patients without tumor regression. Cebon, et al, 
"Phase I Studies of Immunization with Melan-A and n>12 in HLA A2+ Positive Patients with 
Stage m and IV Malignant Melanoma," [Abstract 1671], American Society of Clmical Oncology 
35* Annual Meeting, Atlanta (1999). 
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Scheibenbogen. et al. immunized 18 patients with 4 HLA class I restricted tyrosinase 
peptides, 16 with metastatic melanoma and 2 adjuvant patients. Scheibenbogen, et al, 
"Vaccination with Tyrosinase peptides and GM-CSF in Metastatic Melanoma: a Phase H Trial," 
[Abstract 1680], American Society of Clinical Oncology 35* Annual Meeting, Atlanta (1999). 
Increased CTL activity was observed in 4/15 patients, 2 adjuvant patients, and 2 patients with 
evidence of tumor regression. As in the trial by Cebon et al, patients with progressive disease did 
not show boosted immunity. In spite of the various efforts expended to date to generate efiScacious 
anticancer vaccines, no such composition has yet been developed. 

Vaccine strategies to protect against viral diseases have had many successes. Periiaps the 
most notable of these is the progress that has been made against the disease small pox, which has 
beeai driven to extinction. The success of the polio vaccine is of a similar magnitude. 

Viral vaccmes can be grouped into three classifications: live attenuated virus vaccines, 
such as vaccinia for smaU pox, the Sabin poliovirus vaccine, and measles mumps and rubella; 
whole kUIed or inactivated virus vaccines, such as die Salk poliovirus vaccine, hepatitis A virus 
vaccine and the typical mfluenza virus vaccines; and subunit vaccines, such as hepatitis B. Due to 
thefr lack of a complete viral genome, subunit vaccines offer a greater degree of safety than those 
based on whole viruses. 

The paradigm of a successful subunit vaccine is the recombinant hepatitis B vaccine based 
on the viruses envelope protein. Despite much academic interest in pushing tiie subunit concept 
beyond smgle protems to individual epitopes tiie efforts have yet to bear much fruit Viral vaccine 
research has also concentrated on the induction of an antibody response altiiough ceUular responses 
also occur. However, many of the subunit formulations are particularly poor at generating a CTL 
response. 

Summary of the Invention 

The present invention is du-ected to metiiods and compositions for inducing an antigen 
presentmg cell to present a particular target ceU-specific epitope, thereby promoting a prolonged, 
du-ected cytotoxic T cell response to tiie target cell. 
Vaccines Comprising Housekeeping Epitopes 

In one aspect of the invention, there is provided a vaccine mcluding a housekeeping 
epitope derived from an antigen associated witii a target cell. Advantageously, tiie target cell may 
be a neoplastic cell. The neoplastic cell can be any transformed cell associated with solid tumors 
or lymphomas such as leukemia, carcinoma, lymphoma, astrocytoma, sarcoma, glioma, 
retinoblastoma, melanoma, Wihn's tmnor, bladder cancer, breast cancer, colon cancer, 
hepatocellular cancer, pancreatic cancer, prostate cancer, lung cancer, liver cancer, stomach cancer, 
cervical cancer, testicular cancer, renal ceU cancer, and brain cancer. Alternatively, the target cell 
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can be infected by an intracellular parasite. For example, the intracellular parasite may be a virus 
such as an adenovirus, cytomegalovirus, Epstein-Barr virus, herpes simplex virus, human 
herpesvirus 6, varicella-zoster virus, hepatitis vmises, papiUoma virus, parvovirus, polyomavuus, 
measles virus, rubella vbms, human immunodeficiency virus (HIV), or human Tcell leukemia 
virus. The intracellular parasite may be a bacterium, protozoan, fungus, or a prion. More 
particularly, the intraceUular parasite can be Chlamydia, Listeria, Salmonella, Legionella, 
Brucella, Caxiella, Xickettsia, Mycobacterium, Leishmania, TYypanasoma, Toxoplasma, and 
Plasmodium. 

The housekeeping epitope can be derived from an antigen associated with the target cell. 
The antigen can be MelanA (MART-I), gplOO (Pmel 17), tyrosinase, TRP-I, TRP-2, MAGE-1, 
MAGE-3, BAGE, GAGE-1, GAGE-2, CEA, RAGE, NY-ESO, SCP-1, HomyMel-40, FRAME, 
p53, H-Ras, HER-2/neu, BCR-ABL, E2A-PRL, H4-RET, IGH-IGK, MYL-RAR, Epstein Barr 
virus antigens, EBNA, human papUlomavirus (HPV) antigens E6 and E7, TSP-180, MAGE-4, 
MAGE-5, MAGE-6, pl85erbB2, pl80erbB-3, c-met, nm-23Hl, PSA, TAG-72-4, CAM 17.1, 
15 NuMa, K-ras. P-Catenin. CDK4, Mum-1, pl6, TAGE, PSMA, PSCA, CT7, telomerase, 43-9F, 
5T4, 791Tgp72, alpha-fetoprotein , 6-HCG, BCA225, BTAA, CA 125, CA 15-3 (CA 
27.29VBCAA), CA 195, CA 242, CA-50, CAM43, CD68\KP1, CO-029, FGF-5, G250, Ga733 
(EpCAM), HTgp-175, M344, MA-50, MG7-Ag, MOV18, NB/70K, NY-CO-1, RCASl, 
SDCCAG16, TA-90 (Mac-2 binding protein\cycIophilin C-associated protein), TAAL6, TAG72, 
TLP, TPS, GA733-2\KSA and the like. Optionally, die antigen can be a virus-associated antigen. 
In anoAer aspect of the invention, the antigen can be a parasite-associated antigen. 

In another aspect of the invention, the housekeeping epitope may include or encode a 
polypeptide of about 6 to about 23 ammo acids in length. Preferably, the polypeptide is 9 or 10 
amino acids in lengfli. The polypeptide may be a synthetic polypeptide. Advantageously, the 
25 vaccine additionally includes buffers, detergents, surfactants, anti-oxidants, or reducing agents. In 
yet another aspect of the vaccine, the housekeeping epitope includes a nucleic acid. In a preferred 
ranbodunent, ihe housekeeping epitope is specific for at least one allele of MHC. The allele can 
encode types Al, A2, A3, All, A24, A26, A29, B7, B8, B14, B18, B27, B35, B44, B62, B60, or 
B51. 

30 In yet another aspect of the present invention, the vaccme may include an immune epitope. 

Optionally, the immune epitope is derived from a second antigen associated with the target cell. 
The first antigen and the second antigen may be the same or different. Advantageously, the 
housekeeping epitope is specific for a first allele of MHC, and the immune epitope is specific for a 
second allele of MHC. The first allele and second allele may be the same or different. 
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In stai anoliier aspect of the invention, flie vaccine includes an epitope cluster (see below) 
that includes immune epitope. He epitope cluster can be derived fiom a second antigen 
associated with the target ceU. The first antigen and die second antigen may be the same or 
dififerent, Advanta^ously, Ae epitope cluster includes or encodes a polypeptide having a length of 
at least 10 ammo acids but less tiian about 60 amino acids. Preferably, the length of the 
polypeptide of the epitope cluster is less than about 80%, 50%, or 20% of the length of the second 
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In anoflier aspect of the invention, the vaccine fiuther mcludes a second housekeeping 
epitope derived &om a second antigen associated with a second target cell. Optionally, the first 
antigM and the second antigen can be tiie same. Alternatively, the first and second antigen are 
different Similarly, the first and second target cell may be the same or different 

The vaccme of the present mventiim may advantageously include a nucleic acid construct 
fliat encodes a housekeeping epitope derived fix)m an antigen associated with a target cell. 
Prrferably, the target cell is a neoplastic cell. The neoplastic cell can be any transformed cell 
15 associated with soM tumors or lymphomas such as leukemia, carcinoma, lymphoma, astrocytoma, 
sarcoma, glioma, retinoblastoma, melanoma, Wihn's tumor, bladder cancer, breast cancer, colon 
cancer, hepatocellular cancer, pancreatic cancer, prostate cancer, lung cancer, liver cancer, stomach 
cancer, cervical cancer, testicular cancer, renal cell cancer, and bram cancer. In contrast, the target 
cell can be a cell infected by an mtracellular parasite. The intracellular parasite may be a virus. In 
20 particular, the virus may be an adenovirus, cytomegalovirus, Epstein-Barr virus, herpes simplex 
virus 1, herpes sunplec virus 2, human herpesvmis 6, varicella-zoster virus, hepatitis B virus, 
hepatitis D virus, papaioma virus, parvovirus B19, polyomavirus BK, polyomavirus JC, hepatitis C 
virus, measles virus, rubeUa virus, human immunodeficiency virus (HIV), human T cell leukemia 
virus I, or human T cell leukemia virus n. Optionally, the intracellular parasite is a bacterium, 
25 protozoan, fungus, or prion. More particularly, the intraceUular parasite can be Chlamydia, 
Listeria, Salmonella, Legionella, Brucella, Coxiella, Rickettsia, Mycobacterium, Leishmcmia, 
Dypanasoma, Toxoplasma, and Plasmodium. 

The antigen of the vaccine including a nucleic acid construct may be MelanA (MART-I), 
gplOO (Pmel 17), tyrosinase, TRP-1, TRP-2, MAGE-1, MAGE-3, BAGE, GAGE-1, GAGE.2, 
30 CEA, RAGE, NY-ESO, SCP-1, HomMel-40, FRAME, p53, H-Ras, HER-2/neu, BCR-ABL, E2A- 
PRL, H4-RET, IGH-IGK, MYL-RAR, Epstein Barr virus antigens, EBNA, human papiUomavirus 
(HPV) antigens E6 and E7, TSP-180, MAGE-4, MAGE-5, MAGE-6, pl85erbB2, pl80erbB-3, c- 
met. nm.23Hl, PSA, TAG-72-4, CAM 17.1, NuMa, K-ras, p-Catenin, CDK4, Mum-1, and pl6. 
Alternatively, the antigen can be an antigen associated with a virus or viral mfection. In still 
35 another embodiment, the antigen is an antigen associated widi non-viral intracellular parasites. 
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The housekeeping epitope preferably encodes a polypeptide of about 6 to about 23 amino 
acids in length. More preferably, the housekeeping epitope encodes a polypeptide of 9 to 10 amino 
acids in lengfli. Advantageously, the housekeeping epitope is specific for at least one allele of 
MHC. The allele can encode type Al, A2, A3, Al 1, A24, A26, A29, B7, B8, B14, B18, B27, B35, 
5 B44,B62,B60,orB51. 

In another aspect of the mvention, the vaccine includes an immune epitope. The unmune 
epitope may be derived from a second antigen associated with the target cell. The first antigen and 
second antigen may be the same or different Preferably, the housekeeping epitope is specific for a 
first allele of MHC and the immune epitope is specific for a second allele of MHC. The first allele 

10 and the second allele may be the same or diiferent. 

In still another aspect of flie present invention, the vaccine with a nucleic acid construct 
additionally includes an epitope cluster. The epitope cluster includes an immune epitope. 
Preferably, the epitope clustCT is derived from a second antigen associated with the target cell. The 
first antigen and the second antigen may be the same or different. 

15 Advantageously, the epitope cluster includes or encodes a polypeptide having a length of 

at least 10 ammo acids and less than about 60 amino acids. In a preferred embodiment, the epitope 
cluster includes or encodes a polypeptide with a length less than about 80% of the length of the 
second antigen. In another preferred embodiment, the length of the polypeptide is less than about 
50 % of the lengtii of the second antigen. In a particularly preferred embodiment, the length of the 

20 polypeptide is less than about 20% of the length of the second antigen. 

In yet another aspect of the present invention, the vaccine including a nucleic acid 
construct fiirdier includes a second housekeeping epitope, wherein the second housekeeping 
epitope is derived from a second antigen associated with a second target cell. The first antigen and 
the second antigen can be the same or different. Preferably, the first target ceD and the second 

25 target cell are different 
Nucleic Acid Constructs 

The invention provides a nucleic acid construct including a first coding region, wherein the 
first coding region includes a first sequence encoding at least a first polypeptide, wherein the first 
polypeptide mcludes a first housekeeping epitope derived from a first antigen associated with a 

30 first target cell. The first coding region can further include a second sequence encoding at least a 
second polypeptide, wherein the second polypeptide includes an second epitope derived from a 
second antigen associated with a second target cell. The first polypeptide and the second 
polypeptide can contiguous or non-contiguous. The second epitope can be a housekeeping epitope 
or an immune epitope. The first antigen and the second antigen can be the same or different; 
35 likewise, the first and second target cells can be the same or different. 
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The target cell can be a neoplastic cell, such as, for example, leukemia, carcinoma, 
lymphoma, astroqrtoma, sarcoma, glioma, retinoblastoma, melanoma, Wilm's tumor, bladder 
cancer, breast cancer, colon cancer, hepatocellular cancer, pancreatic cancer, prostate cancer, lung 
cancer, liver cancer, stomach cancer, cervical cancer, testicular cancer, renal cell cancer, or brain 
5 cancer. The first antigen can be, for example, MART-l/MelanA, gplOO (Pmel 17), tyrosinase, 
TRP-1, TRP.2, MAG&l, MAGE-S, BAGE, GAGE-1, GAGE.2, pl5, NY-ESO, products of an 
SSX gene family member, CT-7, and products of an SCP gene family member. The target cell can 
be infected by a vims such as, for example, adenovirus, cytomegalovirus, Epstein-Barr vmis, 
herpes simplex virus 1 and 2, human herpesvirus 6, varicella-zoster virus, hepatitis B virus, 
10 hepatitis D virus, papillomavirus, parvovirus B19, polyomavirus BK, polyomayinis JC, hepatitis C 
virus, measles virus, rubella virus, human immunodeficiency virus (HTV), human T-cell leukemia 
virus I, or human T-cell leukemia virus H, The target cell can likewise be infected by a bacterium, 
a protozoan, a fimgus, a prion, or any other intracellular parasite, examples of which are 
Chlamydia, Listeria, Salmonella, Legionella, Brucella, Coxiella, Rickettsia, Afycobacterium, 
15 Leishmania, Trypanosoma, Toxoplasma, and Plasmodium, 

The construct typically includes a first promoter sequence operably linked to the fu-st 
codmg region. The promoter can be, for example, cytomegalovirus (CMV), SV40 and retroviral 
long terminal repeat (LTR). The promoter can be a bidirectional promoter, and/or a second 
promoter sequence can be operably linked to a second coding region. The nucleic acid construct 

20 can further include a poly-A sequence operably linked to the &st coding region, the second coding 
region, or both. The nucleic acid construct can also include an internal ribosome entry site (IRES) 
sequence, a ubiquitin sequence, an autocatalytic peptide sequence, enhancers, nuclear import 
sequences, hnmunostunulatory sequences, and expression cassettes for cytokines, selection 
markers, reporter molecules, and the like. ITie first polypeptide can be about 7 to 15 amino acids 

25 in length, and is preferably 9 or 10 amino acids in length. The second polypeptide can be 9 or 10 
amino acids in length, or it can be an epitope cluster between about 1 0 and about 75 amino acids in 
length. The first epitope and second epitopes can bind the same or different alleles of MHC. 

Other embodiments of the invention include a vaccine that includes any of the foregoing 
nucleic acid construct embodiments; a method of treating an animal by administering such a 

30 vaccine; and a method of making the vaccine. 
Identification of Epitope Clusters 

Other embodiments of the invention disclosed herein relate to the identification of epitope cluster 
regions that are used to generate pharmaceutical compositions capable of inducing an immune 
response firom a subject to whom the compositions have been administered. One embodiment of 
35 the disclosed invention relates to an epitope cluster, the cluster bemg derived from an antigen 
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associated with a target, the cluster including or encoding at least two sequences having a known 
or predicted affinity for an MHC receptor peptide binding cleft, wherein the cluster is a fragment 
of the antigen. 

In one aspect of tiie invention, the target is a neoplastic cell. Alternatively, the target may 
5 be a cell infected by an intracellular parasite. The intracellular parasite can be a virus, a bacterium 
or a protozoan. Optionally, the target is a pathogenic agent The pathogenic agent can include a 
virus, a bacterium, a fungus, a protozoan, a prion, a toxin, or a venom. 

In another aspect of the invention, the MHC receptor may be a class I HLA receptor. 
Similarly, the MHC receptor can be a class U HLA receptor. 
^0 In yet another aspect of the invention, the cluster includes or encodes a polypeptide having 

a length, wherein the length is at least 10 amino acids. Advantageously, the length of the 
polypeptide may be less than about 75 ammo acids. 

In still another aspect of the invention, there is provided an antigen having a length, 
v/herein the cluster consists of or encodes a polypeptide having a length, wherein the length of the 
15 polypeptide is less than about 80% of the length of the antigen. Preferably, the length of the 
polypeptide is less than about 50% of the length of the antigen. Most preferably, the length of the 
polypeptide is less than about 20% of the length of the antigen. 

Another embodiment of the disclosed invention relates to a method of identifying an 
epitope cluster including the steps of: providmg a sequence of an antigen associated with a target 
20 cell; scoring candidate peptides within the sequence, based on known or predicted affinity for an 
MHC receptor peptide binding clefi to identify putative MHC epitopes; and identifying a region 
withm the antigen, wherein the region includes at least two of the putative MHC epitopes, and 
v*erein the region comprises a higher density of putative MHC epitopes than a density of putative 
MHC epitopes in the antigen as a whole. 
25 Methods of Treatment 

A method of treating an animal by administering to an animal a vaccine including a first 
housekeeping epitope, wherein the housekeeping epitope is derived fi-om a first antigen associated 
with a first target cell is similarly contemplated by the present invention. Preferably, the 
administering step includes a mode of delivery that is transdermal, intranodal, perinodal, oral, 
30 intravenous, intradermal, intramuscular, intraperitoneal, or mucosal. 

Hie method of treating an animal may additionally include an assaying step to determine a 
characteristic indicative of a state of the target cells. Advantageously, the assaying step may 
further include a first assaying step and a second assaying step, wherein the first assaying step 
precedes the administering step, and the second assaying step follows the administering step. 
35 Preferably, the characteristic determined in tiie first assaying step is compared with the 
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characteristic detramined in the second assaying step to obtain a result. The result can be a 
diminotion in number of target ceils, a loss of mass or size of a tumor comprising target cells, or a 
decrease in number or concentration of an intracellular parasite infecting target cells. 

Preferably, the target cell is a neoplastic cell. The neoplastic cell can be any transformed 
cell associated with solid tumors or lymphomas such as leukemia, carcinoma, lymphoma, 
astrocytoma, sarcoma, glioma, retinoblastoma, melanoma, Wibn's tumor, bladder cancer, breast 
cancer, colon cancer, hepatocellular cancer, pancreatic cancer, prostate cancer. Lung cancer, liver 
cancer, stomach cancer, cervical cancer, testicular cancer, renal cell cancer, and brain cancer. 
Alternatively, the target cell is infected by an intracellular parasite. The intracellular parasite may 
be a virus. The virus can be adenovirus, cytomegalovirus, Epstein-Barr virus, herpes sfanplex virus 
1, herpes simplex virus 2, human herpesvirus 6, varicella-zoster virus, hepatitis B virus, hepatitis D 
virus, papilloma virus, parvovmis B19, polyomavirus BK, polyomavirus JC, hepatitis C virus, 
measles virus, rubella virus, human immunodeficiency virus (HIV), human T cell leukemia virus I, 
or human Tcell leukemia virus H. The intracellular parasite may be a bacterium, protozoan, 
15 fungus, or a prion. Advantageously, the intracellular parasite is Chlamydia, Listeria, Salmonella, 
Legionella, Brucella, Coxiella, Rickettsia, Mycobacterium, Leishmania, Trypanosoma, 
Toxoplasma, md Plasmodium. 

In another aspect of the invention, the antigen is MelanA (MART-I), gplOO (Pmel 17), 
tyrosinase, TRP-1, TRP-2, MAGE-1, MAGE-3, BAGE, GAGE-I, GAGE-2, CEA. RAGE, NY-, 
20 ESQ, SCP-1, HomMel-40, FRAME, p53, H-Ras, HER-2/neu, BCR-ABL, E2A-PRL, H4-RET. 
IGH-IGK, MYL-RAR, Epstein Barr vuus antigens, EBNA, human papillomavirus (HPV) antigens 
E6 and E7, TSP-180, MAGE.4, MAGE-5, MAGE-6, pl85erbB2, pl80erbB-3, c-met, nm-23HI, 
PSA, TAG-72-4, CAM 17.1, NuMa, K-ras, p-Catenin, CDK4, Mum-1, and pl6. Alternatively, the 
antigen is associated with a virus or viral infection. In still another aspect, the antigen is an antigen 
25 associated with non-vu^l intracellular parasites. 

The housekeeping epitope may include or encode for a polypeptide of about 6 to about 23 
amino acids in length. Preferably, the polypeptide is 9 or 10 amino acids in length. The 
polypeptide may be synthetic. The vaccme may additionally include buffers, detergents, 
surfactants, anti-oxidants, or reducmg agents. The housekeeping epitope may advantageously 
30 include a nucleic acid. Preferably, the housekeeping epitope is specific for at least one allele of 
MHC. The allele can encode types Al, A2, A3, All, A24, A26, A29, B7, B8, B14, B18, B27, 
B35, B44, B62, B60, or B5 1 . 

In yet another aspect of the invention, the method of treating an animal ftirther includes an 
immune epitope. The unmune epitope may be derived from a second antigen associated wifli the 
35 target cell. Optionally, the first antigen and the second antigen are the same. The housekeeping 
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epitope can be specific for a first allele of MHC, and the immune epitope can be specific for a 
second aUele of MHC. Hie first allele and the second allele may be the same or different 

Advantageously, the vaccine includes an epitope cluster that includes the unmune epitope. 
Tbs epitope cluster may be derived from a second antigen associated with the target cell. 
5 Optionally, the first antigen and the second antigen are the same. The epitope cluster may mclude 
or encode a polypeptide having a length of at least 10 amino acids and less than about 60 amino 
acids. 

Preferably, the epitope cluster includes or encodes a polypeptide having a length less than 
about 80% of the length of the second antigen. The length of the polypeptide can be less than 
10 about 50% of the length of the second antigen. In still another aspect, the length of the polypeptide 
can be less than about 20% of the length of the second antigen. 

The method of treating an animal may further include a second housekeeping epitope, 
wherehi the second housekeepmg epitope is derived from a second antigen associated with a 
second target cell. The fust antigen and the second antigen may be the same or different 
15 Similarly, the first target cell and the second target cell may be the same or different 

A method of treating an animal includmg administermg to an animal a vaccine comprising 
a nucleic acid construct is also contemplated by the present invention. The nucleic acid construct 
advantageously encodes a housekeepmg epitope. The housekeeping epitope may be derived from ■ 
a first antigen associated with a first target cell. 
20 In another aspect of die mvention there is provided a method of making a vaccine. The 

method includes the steps of selecting a housekeeping epitope by identifying epitopes that are or 
could be produced from a particular antigen source by housekeeping proteasomes wherein the 
housekeeping epitope is derived from a first antigen associated with a first target cell, making a 
vaccine including the housekeeping epitope, and preparing a vaccine composition that includes or 
25 encodes the selected housekeeping epitope. 

The vaccine made in accordance with the aforementioned method is likewise provided by 
the present invention. The vaccine can be administered to treat an animal. Thus, a method of 
treating an animal with the vaccine is similarly contemplated. 
Discoverv of Housekeeping and Other Epitopes 
30 Otiier embodiments of the invention disclosed herein are directed to the identification of 

epitopes that are useful for generating vaccines capable of inducing an immune response from a 
subject to whom the compositions have been administered, particularly those epitopes most useful 
in the vaccine embodiments of the invention. One embodiment of the invention relates to a 
method of epitope discovery comprising the step of selectmg an epitope from a population of 
35 peptide fragments of an antigen associated with a target cell, wherein the fragments have a known 
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or predicted aflBnity for a major histocompatibility complex class I receptor peptide binding cleft, 
wherein the epitope selected corresponds to a proteasome cleavage product of the target cell. 

Another embodiment of the invention relates to a method of discovering an epitope 
comprising the steps of: providing a sequence from a target cell, wherein the sequence encodes or 
5 comprises a protein e;q>ressed in the target cell; identifying a population of peptide fragments of 
the protein, wherein members of the population of peptide fragments have a known or predicted 
afiBnity for a major histocompatibility complex class I receptor peptide binding cleft; selecting the 
epitope from the population of peptide fragments, wherein the epitope corresponds to a product of 
a proteasome active in the target cell. 
10 One aspect of this embodiment relates an epitope discovered by the aforementioned 

method. Another aspect of this embodiment relates to a vaccine comprising the discovered 
epitope. Still another aspect of the invention relates to a method of treating an animal, comprising 
administering to the animal the aforementioned vaccine. 

One embodiment of the disclosed invention relates to a method of epitope discovery 

15 comprising the steps of: providing a neoplastic cell and a sequence, wherein the sequence 
comprises or encodes an antigen associated with the neoplastic cell; identifying a population of 
peptide fragments of the antigen, wherein the population of peptide fragments is predicted to have 
an af3finity for a major histocompatibility complex class I receptor peptide binding cleft; selectmg 
an epitope from the population of peptide fragments, wherein the epitope is determine by in vitro 

20 analysis to be a proteasome cleavage reaction product of a proteasome active in the neoplastic celi. 

One aspect of this embodunent relates an epitope discovered by the aforementioned 
method. Another aspect of this embodiment relates to a vaccine comprising the discovered 
epitope. Still another aspect of the invention relates to a method of treating an animal, comprising 
administering to the animal the aforementioned vaccine. 

25 Another embodiment of the disclosed invention relates to a method of epitope discovery 

comprismg the step of selecting an epitope from a population of peptide Augments of an antigen 
associated with a target m a host, wherein the fragments have a known or predicted afiFmity for a 
major histocompatibility complex class I or II receptor peptide binding cleft of the host, wherein 
the epitope selected corresponds to a product of proteolytic cleavage of the antigen in a cell of the 

30 host. 

One aspect of this embodiment relates an epitope discovered by the aforementioned 
method. Another aspect of this embodiment relates to a vaccine comprising the discovered 
epitope. Still another aspect of the invention relates to a method of treatuig an animal, comprismg 
administering to the animal the aforementioned vaccine; 

35 
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Brief Description of the Drawings 
Figure 1 depicts schematically tiie parts of a cell involved in protein processing by the 
proteasome and epitope presentation. 

Figure 2 is a comparison of the housekeeping proteasome and the immune proteasome. 
5 Figure 3 depicts schematically epitope synchronization between mfected cells and pAPCs. 

Figure 4 shows presentation of different epitopes by pAPCs and tumor cells. 
Figure 5 shows presentation of different epitopes by pAPCs and infected cells. 
Figure 6 depicts presentation by tumor cells of both housekeeping and immune epitopes 
due to induction by IFN-gamma. 
10 Figure 7 shows an attack of virally mfected cells by T ceUs induced to recognize a 

housekeeping epitope. 

Figure 8 shows a dual attack against both housekeeping and immune epitopes. 
Figure 9 is a depiction of the components of plasmid pVAX-EPl-IRES-EP2-ISS-NIS. 
Figure 10 is a depiction of the components of plasmid pVAX-EP2-UB-EPl. 
15 Figure 11 is a depiction of the components of plasmid pVAX-EP2-2A-EPL 

Figure 12 is a depiction of the components of plasmid pVAX-EPl-IRES-EP2. 
Figure 13 depicts the results of a flow cytometry assay verifying HLA binding by Melan-A 
epitopes. 

Figure 14 depicts the results of a flow cytometry assay verifying HLA binding by 
20 Tyrosinase peptide 207-21 6. 

Figure 15 depicts the sequence of Melan-A (SEQ ID NO: 1), showing clustering of class I. 
HLA epitopes. 

Figure 16 depicts the sequence of SSX-2 (SEQ ID NO: 2), showing clustering of class I 
HLA epitopes. 

25 Figure 17 depicts the sequence of NY-ESO (SEQ ID NO: 3), showing clustering of class I 

HLA epitopes. 

Figure 18 depicts the sequence of Tyrosinase (SEQ ID NO: 4), showing clustering of class 
I HLA epitopes predicted by the BIMAS-NIH/Parker algorithm above the line of sequence and by 
the SYFPEITHiyRammensee algorithm below. 
30 Detailed Description of the Preferred Embodiment 

Embodiments of the present invention provide epitopes, vaccines, and therapeutic methods 
for directing an effective immune response against a target cell. A primary basis of the invention 
is the novel and unexpected discovery that many target cells display epitopes that are different 
from the epitopes displayed by professional antigen presenting cells (pAPCs). Because of this 
35 difference, the pAPCs direct T cells agamst epitopes that are not present on the target cells, and the 
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T cells therefore fail to recognize the target cells. The metiiods and medicaments of the present 
invention can cause pAPCs to display the same epitopes that are present on target cells, resulting in 
T cells that are correctly able to recognize and destroy the target cells. 

Embodunents of the invention disclosed herein further provide methods for identifying 
5 epitopes of target antigens tiiat can be used to generate immunologically effective vaccines. Such 
vaccmes can stimulate the inunune system to recognize and destroy target cells displaying the 
selected epitopes. Embodiments of the invention are particularly useful in the treatment and 
prevention of cancers and of mfections of cells by intracellular parasites, as well as in the treatment 
or prevention of conditions associated with oflier pathogens, toxins, and allergens. 

10 Certain kinds of targets are particularly elusive to the munune system. Among these are 

many kinds of cancer, as weU as cells infected by intracellular parasites, such as, for example, 
viruses, bacteria, and protozoans. A great deal of research has been done to identify useful 
antigens and epitopes for generating an effective immune response against such targets, with little 
success. This disclosure provides a basis for the efficient discovery of a new generation of 

IS effective epitopes effective against such elusive targets. 

The invention disclosed herem makes it possible to select epitope sequences with true 
biological relevance. For an epitope to have biological significance, e.g., to fimction m stimulating 
an hnmune response, it must have an affmity for the bmding cleft of a major histocompatibility . 
complex (MHC) receptor peptide. There are various means, known in the art, of predictmg 

20 whether an oligopeptide sequence will have an MHC binding affinity. However, most of the 
sequences predicted to have MHC binding affinity are not biologically relevant, because they are 
not actually presented on the surface of a target cell or a pAPC. 

The melhods of flie disclosed invention permit the vaccine designer to ignore peptides that, 
despite predicted high binding affinity for MHC, will never be useful because they cannot be 

25 presented by target cells. Accordingly, methods and teachings disclosed herein provide a major 
advance in vaccine design, one that combines the power of antigen sequence analysis with the 
fundamental realities of immunology. The methods taught herein allow for the simple and 
effective selection of meaningful epitopes for creation of MHC class I or class II vaccines using 
any polypeptide sequence corresponding to a desired target. 

30 Further embodiments of the invention disclosed herein provide epitope cluster regions 

(ECRs) for use in vaccines and in vaccine design and epitope discovery. Specifically, 
embodiments of the invention relate to identifying epitope clusters for use in generating 
immunologically active compositions directed against target cell populations, and for use m the 
discovery of discrete housekeeping epitopes and hnmune epitopes. In many cases, numerous 
35 putative class I MHC epitopes may exist m a smgle target-associated antigen (TAA). Such 
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putative epitopes are often found in clusters (ECRs), MHC epitopes distributed at a relatively high 
density within certain regions in the amino acid sequence of the parent TAA. Since these ECRs 
include multiple putative epitopes with potential useful biological activity in inducing an immune 
response, they represent an excellent material for in vitro or in vivo analysis to identify particularly 
5 useful epitopes for vaccine design. And, since the epitope clusters can themselves be processed 
inside a cell to produce active MHC epitopes, the clusters can be used directly in vaccmes, with 
one or more putative epitopes in the cluster actually being processed into an active MHC epitope. 

The use of ECRs m vaccines offers important technological advances in the manufacture 
of recombinant vaccmes, and further offers crucial advantages in safety over existing nucleic acid 
10 vaccines that encode whole protein sequences. Recombmant vaccines generally rely on expensive 
and technically challenging production of whole proteins in microbial fermentors. ECRs offer the 
option of usmg chemically syntiiesized polypeptides, greatly sunplifying development and 
manufacture, and obviatmg a variety of safety concerns. Sunilarly, the ability to use nucleic acid 
sequences encoding ECRs, which are typically relatively short regions of an entire sequence, 
15 allows the use of synthetic oligonucleotide chemistry processes in the development and 
manipulation of nucleic acid based vaccuies, rather than the more expensive, time consuming, and 
potentially difficult molecular biology procedures involved with using whole gene sequences. 

Since an ECR is encoded by a nucleic acid sequence that is relatively short compared to . 
that which encodes the whole protein from which the ECR is found, this can greatly improve the . 
20 safety of nucleic acid vaccmes. An important issue in the field of nucleic acid vaccines is the fact 
that the extent of sequence homology of the vaccme with sequences m the animal to which it is 
administered determines the probability of integration of the vaccine sequence into the genome of 
the animal. A fundamental safety concern of nucleic acid vaccines is their potential to integrate 
into genomic sequences, which can cause deregulation of gene expression and tumor 
25 transformation. The Food and Drug Admmistration has advised that nucleic acid and recombinant 
vaccines should contain as little sequence homology with human sequences as possible. In the case 
of vaccines delivering tumor-associated antigens, it is inevitable that the vaccines contain nucleic 
acid sequences that are homologous to those which encode proteins that are expressed in the tumor 
cells of patients. It is, however, highly desirable to limit the extent of those sequences to that which 
30 is minunally essential to facilitate the expression of epitopes for mducing therapeutic immune 
responses. The use of ECRs thus offers the dual benefit of providing a minimal region of 
homology, while incorporating multiple epitopes that have potential therapeutic value. 

Aspects of the present invention provide nucleic acid constmcts tiiat encode a 
housekeeping epitope. A housekeeping epitope, as will be described in greater detail below, 
35 mcludes peptide firagments produced by the active proteasome of a peripheral cell. A basis for the 
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present invention is the discovery that any antigen associated with a target cell can be processed 
differentially into two distinguishable sets of epitopes for presentation by the class I major 
histocompatibility complex (MHC) molecules of the body. "Immune epitopes" are presented by 
pAPCs and, also generally in peripheral cells that are acutely infected or under active 
5 immunological attack by interferon (IFN) secreting cells. In contrast, "housekeeping epitopes" are 
presented by all other peripheral cells including, generally, neoplastic (cancerous) cells and 
chronically faxfected cells. This mismatch, or asynchrony, in presented epitopes underlies the 
persistence and advance of cancers and chronic infections, despite the presence of a functioning 
bnmune system in the host It is thus essential to bring about synchronization of epitope 

10 presentation between the pAPC and the target cell m order to provoke an effective, cytolytic T 
lymphocyte (CTL)- mediated immune response. 

Synchronization can be accomplished most reliably by providing the pAPC with a 
housekeeping epitope. Often a more robust response can be achieved by providing more than a 
single epitope. Additionally, once an effective immune response agamst the target cells has been 

15 established, secretion of BFN may lead to expression of the immune proteasome, thereby switching 
epitope presentation to immune epitopes. For this reason, among others, it can also be 
advantageous to include immune epitopes, in addition to housekeeping epitopes, in vaccines 
developed according to the above referenced disclosure. It can be of further utility to provide 
immune epitopes in the form of an ECR. Embodiments of the invention provide expression . 

20 vectors encoding housekeeping epitopes and/or immune epitopes in a variety of combinations. 
Preferred expression constructs encode at least one epitope capable of stimulating a cellular 
immune response directed against a target cell. In one embodiment of the invention, target cells 
are neoplastic cells. In another embodiment, target cells are any intracellularly infected host cell. 
Intracellular infective agents include persistent viruses and any other infectious organism that has 

25 an intracellular stage of infection. 

The nucleic acid constructs of some embodiments are directed to enhancing a subject's 
immime system and sensitizing it to the presence of neoplastic cells within the host. In other 
embodhnents, the nucleic acid constructs facilitate the eradication of persistent viral infections as 
well as cells infected with mtracellular parasites. 

30 Definitions 

Unless otherwise clear from the context of the use of a term herem, the following listed 
terms shall generally have the indicated meanings for purposes of this description. 

PROFESSIONAL ANTIGEN-PRESENllNG CELL (pAPC) - a cell that possesses T cell 
costimulatory molecules and is able to induce a T cell response. Well characterized pAPCs are 
35 dendritic cells, B cells, and macrophages. 
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PERIPHERAL CELL - a ceU that is not a pAPC. 

HOUSEKEEPING PROTBASOME ^ a proteasome nomally active in peripheral ceils, 
and generally not present or not strongly active in pAPCs. 

IMMUNE PROTEASOME - a proteasome normally active in pAPCs; the immune 
5 proteasome is also active in some peripheral cells in infected tissues. 

EPITOPE - a molecule or substance capable of stimulatmg an immune response. In 
preferred embodiments, epitopes according to this definition include but are not necessarily limited 
to a polypeptide and a nucleic acid encoding a polypeptide, wherein the polypeptide is capable of 
stimulating an immune response. In other preferred embodiments, epitopes according to this 
10 definition include but are not necessarily limited to peptides presented on the surface of cells non- 
covalently bound to the pocket of class I MHC, such that they can interact with T cell receptors. 

MHC EPITOPE - a polypeptide having a known or predicted affinity for a mammalian 
class I major histocompatibility complex (MHC) molecule. 

HLA EPITOPE - a polypeptide having a known or predicted aJBBnity for a human class I 
15 major histocompatibility complex (MHC) molecule. 

HOUSEKEEPING EPITOPE - In a preferred embodiment, a housekeeping epitope is 
defined as a polypeptide fragment that is an MHC epitope, and that is displayed on a cell in which 
housekeeping proteasomes are predominantly active. In another preferred embodiment, a. 
^housekeeping epitope is defined as a polypeptide containing a housekeeping epitope according to . 
20 the foregoing definition, that is flanked by one to several additional amino acids. In another, 
preferred embodiment, a housekeeping epitope is defined as a nucleic acid that encodes a 
housekeeping epitope according to either of the foregoing definitions. 

IMMUNE EPITOPE In a preferred embodunent, an immune epitope is defined as a 
polypeptide fi-agment that is an MHC epitope, and that is displayed on a cell m which immune 
25 proteasomes are predominantly active. In another preferred embodiment, an immune epitope is 
defined as a polypeptide containing an immune epitope according to the foregoing definition, that 
is flanked by one to several additional amino acids. In another preferred embodhnent, an immune 
epitope is defined as a polypeptide including an epitope cluster sequence, having at least two 
polypeptide sequences having a known or predicted affinity for a class I MHC. In yet another 
30 preferred embodiment, an immune epitope is defined as a nucleic acid that encodes an immune 
epitope according to any of the foregoing definitions. 

TARGET CELL - a cell to be targeted by the vaccines and methods of the invention. 
Examples of target cells according to this definition include but are not necessarily Ihnited to: a 
neoplastic cell and a cell harboring an intracellular parasite, such as, for example, a virus, a 
35 bacterium, or a protozoan. 
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TARGET-ASSOCIATED ANTIGEN (TAA) - a protein or polypeptide present in a target 

cell. 

TUMOR-ASSOCIATED ANTIGEN (TuAA) - a TAA, wherein the target cell is a 
neoplastic cell. 

5 Note lhat the following discussion sets forth the inventors' understanding of the operation 

of the invention. However, it is not intended that this discussion limit the patent to any particular 
theory of operation not set forth in the claims. 
Different Proteasomes Yield Different Epitopes 

Epitopes presented by class I MHC on the surface of either pAPCs or peripheral cells are 
10 produced by digestion of proteins within those cells by proteasomes. While it has been reported 
. that the proteasomes of pAPCs are not identical to the proteasomes of peripheral cells, the 
significance of this difference has been heretofore unappreciated. This invention is based on the 
fact that when pAPCs and peripheral cells process a given TAA, the proteasomes active in the 
pAPCs generate epitope fragments that are different from the epitope fragments generated by the 

15 proteasomes that are active m the peripheral cells. For convenience of reference, and as defined 
above, tiie proteasomes that are predominantly active m pAPCs are referred to herein as "immune 
proteasomes** while the proteasomes that are normally active in peripheral cells are referred to 
herein as "housekeeping proteasomes,*' 

The significance of the differential processing of TAAs by pAPCs and peripheral cells 

20 cannot be overstated. This diiSerential processing provides a unified explanation for why certain 
target cells are resistant to recognition and attack by the immune system. Although pAPCs can 
take-up TAAs shed from target cells and present them on their surface, the pAPCs will 
consequently stimulate the production of CTLs to recognize an "immune epitope" (the epitope 
resulting from processing of the TAA by the immune proteasome), whereas the target cells display 

25 "housekeeping epitopes" (distinct fragments of the TAA generated by the housekeeping 
proteasome). As a consequence, the CTL response under physiological conditions is misdirected 
away from epitopes on the target cells. 

Since CTL responses are induced by pAPCs, by definition they target unmune epitopes 
rather than housekeepmg epitopes and thus fail to recognize target cells, which are therefore able 

30 to persist in the body. This fundamental "epitope compartmentalization" of Ae cellular immune 
response is the reason that some neoplastic cells can persist to form tumors; it is also the reason 
that some viruses and intracellular parasites can chronically infect cells without being eradicated 
by the immune system. With regard to infectious agents, normally they cause the expression of 
immune proteasomes in the cells they mfect This results in the production of epitopes on the cell 

35 surface that are identical to those being presented by pAPCs to the inunune system. Infection thus 
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results in "epitope synchronization" between ttie immune system and the infected cell, subsequent 
destruction of the infected cells, and clearance of infectious agent from the body. In the case of 
some infectious agents, notably those that are capable of establishing chronic infections, they have 
evolved a means of preventing expression of immune proteasomes in the cells they infect The 
5 proteasome in these cells are mamtdned in a housekeepmg mode, thereby preventing epitope 
synchronization and attack by CTL, There is substantial evidence that this is a common mechanism 
used by virtually all chronic infectious agents. 

One way to overcome this failure on the part of CTLs to recognize and eradicate certain 
target cells is to provide vaccines and treatment methods that are capable of "synchronizing'* 

10 epitope presentation. Epitope synchronization in this context means that the pAPCs are made to 
present housekeeping epitopes, resultmg in CTLs that can recognize the housekeeping epitopes 
displayed on target cells, and thereby attack and elhninate the target cells. 

Accordingly, embodiments of the mvention are useful for treating neoplastic diseases 
includmg solid tumors and lymphomas. Additional embodiments of the mvention have application 

15 in treating persistent viral infections as well as parasitic infections in which the infective agent has 
an intracellular stage of infection. Appropriate administration of housekeeping epitopes 
corresponding to such target cells can activate a specific, cytotoxic T cell response against the 
target cells. 

The Role of Epitope Differences in Cancer 

20 In some embodiments, the present invention is directed to treating neoplastic diseases. 

Cancers are caused by the progressive, unregulated growth of the progeny of a single abnormal 
cell. The term "cancer" as used herein includes neoplastic diseases, neoplastic cells, tumors, tumor 
cells, malignancies and any transformed cell, includmg both solid tumors and difluse neoplastic 
disease. Historically, cancer cells generally have been thought to escape detection and destruction 

25 by the immune system because cancer cells contain the same genetic material as other non- 
cancerous cells of the body. The genetic identity or similarity of cancer cells and healthy cells in 
the body supposedly causes the difficulty of distinguishing cancer cells from normal cells, and the 
immune system is therefore unable to mount an effective immune response, as evidenced by the 
persistence of cancer cells in the body. 

30 To the contrary, a variety of tumor associated antigens (TuAAs) have been described 

which could, and indeed do, provoke immune responses. Numerous studies have described tumor 
infiltrating lymphocytes (TBLs) which can kill target cells presenting peptides derived from various 
TuAAs in vitro. As is described in further detail below, however, the failure of TILs to control 
cancer results from a difference in the epitopes produced and presented by the cells which mduce 
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CTL activity, the pAPC, and the desired target cells, i.e., those of the tumor. To understand the 
difference, it is necessary to understand the functions and dynamics of proteasomes. 

All cells contam proteasomes to degrade proteins. These proteasomes, which comprise 
about 1 % of the total protem content of the cell, serve to regulate protein half-life in the cell. In 
5 tiie course of protem degradation, proteasomes generate the vast majority of peptide fragments 
involved m Class I antigen presentation, and the proteasome cleavage patterns affect the 
availability of antigenic epitopes for presentation on Class I molecules (Figure 1). Thus MHC 
epitopes are produced by the proteasomal activity of cells. However, the proteolytic activity in 
pAPCs, as compared to peripheral cells, is markedly different. The pAPCs contain a proteasome 

10 that constitutively mcorporates subunits that are typically only expressed in peripheral cells during 
infection or after exposure to various cytokines, particularly interferon (IFN), as part of a cellular 
immune response. As set forth above, the different proteasomal activities of pAPCs and peripheral 
cells are referred to herein as inmiune and housekeeping proteasomes, respectively. 

The hnmune and housekeepmg proteasomes have the capacity to cleave proteins at similar 

15 but distmct locations. The unmune proteasome mcorporates several subunits that distmguish it 
from its housekeepmg counterpart. These unmune subunits mclude LMP2, LMP7, and MECLl, 
which replace the catalytic subunits of the housekeeping proteasome, and PA28a and PA28B, 
which serve a regulatory function (Figure 2). Collectively, incorporation of these subunits results 
in activity from the inamune proteasome that is qualitatively and quantitatively different from the. 

20 activity of the housekeeping proteasome. Although evidence has existed that there are differences 
between housekeepmg and unmune proteasomes with respect to the MHC epitopes they produce, 
until now these differences have been rationalized m quantitative terms. It has been suggested by 
others that the ultimate effect mediated by the immune proteasome is to facilitate the production of 
more peptides, rather than different ones. 

25 Qualitative differences in antigen processing between immune and housekeepmg 

proteasomes have serious unpUcations for vaccine design. IFN-y is produced by T lymphocytes, 
where it is mvolved in promoting the induction of cellular immune responses and, as noted above, 
mduces expression of the unmune proteasome. Notably, IFN is also produced by vutually any 
other cell under one condition: in the event that the cell becomes infected by a pathogen. In 

30 nature, vhal infection typically causes IFN production by the infected cell, which m turn induces 
the cell to convert from a housekeepmg proteasome configuration to an immune proteasome 
configuration. One explanation for this phenomenon is that the infection and subsequent IFN up- 
regulation serves to align the mfected cell, in terms of the displayed antigen repertohe, with that of 
the pAPCs mvolved m stunulatmg the unmune response against the virus. This results in the 

35 processing of both its endogenous "self' proteins, expressed normally by the ceU, and those 
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proteins related to the infectioiis agent ("non-self') in an identical manner to antigen processing 
occurring in the pAPCs. The conversion of the infected cell's proteasome from a housekeeping 
configuration to an immune configuration results in "epitope synchronization" between infected 
cells and the pAPC. (Figure 3). 
5 MHC class-I-restricted CTLs specific for TuAAs are an important component of the 

immune response against cancer. TuAAs are useful targets of a tumor-specific T cell response to 
the extent that they are not displayed on the surface of normal cells, or are overexpressed by the 
tumor cells, or are otherwise strongly characteristic of tumor cells. Numerous TuAAs are known 
and are readily available to those of skill in the art m the literature or commercially. 
10 Indeed some tumors have been found to be defective m IFN-y induction of the immune 

proteasomes. In these situations, it is likely that the C1L are targeting immune epitopes Jfrom 
TuAAs that have been processed by pAPC. Despite the high numbers of CTL in these patients 
specifically activated against these immune epitopes, the CTL fail to find the epitope on the cancer 
cells. The disease progresses and eventually the accumulating CTL, unable to locate the target, 

15 become dysfunctional (Lee, et al. Nature Medicine (1999) 5[6]:677-685). By providmg the pAPC 
with housekeeping epitopes, one can synchronize the epitope presentation by pAPCs with the 
epitope presentation by the tumor, and activate a CTL population that recognizes Aose 
housekeeping epitopes present on the tumor. 

Thus, the discovery that the unmune proteasomes in pAPCs produce qualitatively different 

20 epitopes than do the housekeeping proteasomes m peripheral cells provides an explanation for why 
TILs do not eradicate tumor cells. The processmg mechanism described above explains how T 
lymphocytes find their way into tumor masses, and yet are relatively ineffective against the tumor 
cells themselves. Differential antigen processing between the immune proteasome of pAPCs and 
flie housekeeping proteasome of tumor cells can explain the observation of high frequencies of T 

25 lymphocytes specific against TuAAs in patients with progressive cancer. Lee, et al. Nature 
Medicine (1999) 5[6]:677-685. (Figure 4). 

Due to differences in proteasome activity, peripheral target cells, including tumor cells, 
and some cells infected by a virus or other intracellular parasite (all of which express the 
housekeeping proteasome), necessarily display different epitope signals than the epitope signals 

30 that T cells are conditioned by pAPCs to recognize. In view of this discovery, a compelling 
immunoregulatory role for the proteasome emerges. This discovery provides a key to 
manipulating the immune system, particularly the pAPCs, in order to induce an effective and lethal 
cell-mediated attack of target cells. 

Differential antigen processing explains why CTLs specific for TuAAs are often found 

35 among TILs without eradication of the disease. T lymphocyte responses are primed against TuAA 
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that have been processed by the pAPC. CTLs found among TILs are hopelessly targeting class I 
TuAAs that were present on the pAPC, but not on the tumor cells (Figure 4). 

The behavior of tumor cells in the body, namely migration, antigen shedding, induction of 
inflammatory responses, etc., results in strong immune responses. Unfortunately, the natural 
5 mechanism of diflfereutial antigen processing between tumor cells and pAPCs results in epitope 
isolation of the tumor— that is, the tumor has a different epitope signature than that of the pAPCs 
fihat process the TuAA, and thus the tumor epitopes are "isolated" ftom the epitopes that flie CTLs 
are induced by the pAPCs to recognize. The ability to predict and counter this epitope isolation 
effect is crucial for development of a new generation of therapeutic cancer vaccines. Overcoming 

10 epitope isolation results in epitope synchronization. 

The Role of Epitope Differences in Infections bv Viruses and other Intracellular Parasites 

A wide variety of mechanisms are employed by persistent pathogens in order to establish 
chronic infections m the host organism. A common hallmark is reduced or altered antigen 
expression. In some embodiments, the present invention is directed to the treatment and 

15 ^ prevention of intracellular infection by various pathogens. Examples of such pathogens include, 
but are not limited to: any viruses, bacteria, protozoa, prions or other organisms that have an 
intracellular stage of infection in the host. 

Viral antigen presentation by the pAPCs begins with the digestion of viral antigens into 
peptides by the proteasome. After the proteasome digests the protein into peptides, some of the , 

20 peptides are loaded onto the class I complex in the endoplasmic reticulum and transported to the 
cell surface. At the cell surface, the class I-peptide complex is recognized by T cell receptors on 
the surface of CTLs and the infected cells are killed. 

Herpes viruses and retrovmises escape detection and subsequent eradication by the host's 
inmiune system via restricted viral gene expression. Other mechanisms by which certain viruses 

25 may elude the immime system have also been proposed, including "immunologically privileged" 
sites of viral infection and antigenic variation in key viral peptides. While these models may 
explain the persistence of certain viruses, the concept of epitope synchronization, or conversely, 
epitope compartmentalization, provides a solution. Namely, this concept provides a basis for 
vaccines to direct an effective cellular immune response against any virus or other intracellular 

30 parasite that eludes the immune system by blocking unmune proteasome expression in the host 
cells, or otherwise preventing effective epitope synchronization between infected cells and the 
pAPCs. (Figure 5). 

Smce infection of any cell by a pathogen usually causes the mfected cell to produce IFN, 
the proteasome in infected tissue typically switches from the housekeeping configuration to an 
35 unmune configuration. Infection thus has the effect of aligning the mfected cell, in terms of the 
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antigen repertoire it displays on its surface, with that of the pAPCs involved in stimulating the 
immune response against the virus or other intracellular pathogen. When virally infected cells or 
parasiticaily infected cells are mduced to express an unmune proteasome, rather than the 
housekeeping proteasome, the result is "epitope synchronization" between the infected cells and 
5 the pAPCs, and subsequent eradication of the mfected cells by CTL. 

However, certam vhnses and other mtracellular parasites may escape T cell recognition by 
down-regulating flie expression of host molecules necessary for efficient Tcell recognition of 
infected cells. There is evidence that suggests that many chronic viral mfections mterfere with the 
IFN cascade. (See Table 1). Therefore, because of the role of housekeeping proteasomes, immune 

10 proteasomes, and epitope compartmentalization, in many chronic infections, some embodiments of 
the mvention are also applicable to the design of vaccines for any relevant mtracellular parasite, 
including but not limited to viruses, bacteria, and protozoa. All intracellular parasites are targets 
for such vaccine design. These mclude but are not lunited to: viruses such as adenovirus, 
cytomegalovirus, Epstein-Barr virus, herpes simplex virus 1, herpes simplex virus 2, human 

15 herpesvmis 6, varicella-zoster vmis, hepatitis B virus, hepatitis D virus, papilloma vmis, 
parvovmis B19, polyomavmis BK, polyomavirus JC, hepatitis C virus, measles virus, rubella 
virus, human immunodeficiency virus (HTV), human Tcell leukemia virus I, and human Tcell 
leukemia virus H; bacteria such as Chlamydia, Listeria, Salmonella, Legionella, Brucella, Coxiella, • 
Rickettsia, Afycobacterium; and protozoa such as Leishmania, Trypanosoma, Toxoplasma, and. 

20 Plasmodivm, 



Table 1 



Virus 


Viral Gene Product 


Mechanism of IFN 

(Immune Proteasome) Inhibition 


Adenovirus 


EIA protein 


Blocks signal transduction . 




VARNA 


Blocks activation of PKr 


Cowpox virus 


crmA protein 


Serpin, protease mhibitor blocks 
activation ofll-lp 


Epstein-Barr virus 


EBNA-2 protein 


Blocks signal transduction 




EBERRNA 


Blocks activation of Pkr 




BCRFL (VIRAL IL-10) 


IL-10 HOMOLOG (inhibits Ife-y, 
Il-l, 11-2, Tnf synthesis) 


Hepatitis B virus 


Terminal protein 


Blocks signal transduction 


Human immuno- 
DejBciency virus 


Tat protein 


Reduces activity of Pkr 




TARRNA 


Blocks activation of Pkr 
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Virus 


Viral Gene Product 


Mechanism of IFN 

(Immune Proteasome) Inhibition 


Herpes simplex Virus 


Unknown 


Blocks activation of RNase L 


Influenza virus 


NSI 


Binds dsRNA, blocking Pkr 
Activation 




Unknown 


Activates cellular inhibitor of Pkr 
called p58 


Myxomavirus 


M-T7 protein 


Soluble Ifh-y receptor (decoy) 




T2 protein 


Soluble Tnf receptor (decoy) 


Poliovirus 


Unknown 


Activates cellular mhibitor of Pkr 


Reovirus 


Sigma 3 protein 


RtTld<s cfsR'N^A fkXxA hlru^lrc flrtivsifirh-n 

ofPkr 




K31 protein 


Pkr pseudosubstrate 




B15R protein 


Soluble 1 1-1 receptor (decoy) 




A18R protein 


Regulates dsRNA production 



Vaccmes and Methods for Achieving Epitope Synchronization 

As has been discussed herein, effective cellular immunity is based on synchronized epitope 
presentation between the pAPCs and the infected peripheral cells. In the absence of epitope 
5 synchronization, target cells are not recognized by T cells, even if those T cells are directed against 
TAAs. Cancer cells and cells harboring persistent mtracellular parasites elude the cellular immune 
response because they avoid epitope synchronization. ''Natural*' epitope synchronization involves 
activation of unmune proteasomes in infected cells so that the infected cells display immune 
epitopes and are thns recognized by T cells induced by pAPCs. Yet cancers and cells mfected by 

10 persistent intracellular parasites do not have active immune proteasomes and thus go unrecognized 
by the normal array of mduced T cells. 

The vaccmes and methods of preferred embodiments of the present invention thus 
represent, essentially, a "reverse" epitope synchronization, causing the pAPCs to display 
housekeeping epitopes to address situations in which target cells do not display immune epitopes. 

15 (Figures 6 and 7). Certain embodiments also provide a second wave of epitope synchronization by 
mducing pAPCs to display both housekeeping epitopes and immune epitopes corresponding to a 
selected target cell. Thus, in these dual epitope embodunents, once the target cells are effectively 
attacked by T ceUs that recognize housekeeping epitopes, a switch by the target cells to immune 
proteasome processing does not result in a loss of immune recognition. This is because of the 

20 presence of the immune epitope in the vaccine, which acts to induce a population of T cells that 
recognize unmune epitopes. 
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Preferred embodiments of the present invention are directed to vaccines and methods for 
causing a pAPC or population of pAPCs to present housekeeping epitopes that correspond to the 
epitopes displayed on a particular target cell. In one embodiment, the housekeeping epitope is a 
TuAA epitope processed by the housekeeping proteasome of a particular tumor type. In another 
5 embodiment, the housekeeping epitope is a virus-associated epitope processed by the housekeeping 
proteasome of a cell mfected with a virus. This facilitates a specific T cell response to the target 
cells. Concurrent expression by the pAPCs of multiple epitopes, corresponding to different 
induction states (pre- and post-attack), can drive a CTL response effective against target cells as 
they display either housekeeping epitopes or immune epitopes. (Figure 8). 
1 0 By having both housekeeping and immune epitopes present on the pAPC, this embodiment 

can optimize the cytotoxic T cell response to a target cell. With dual epitope expression, the 
pAPCs can continue to sustain a CTL response to the immune-type epitope when the tumor cell 
switches from the housekeepmg proteasome to the immune proteasome with induction by IFN, 
which, for example, may be produced by tumor-infiltrating CTLs. 

15 In a preferred embodiment, immunization of a patient is with a vaccine that includes a 

housekeeping epitope. Many preferred TAAs are associated exclusively with a target cell, 
particularly in the case of infected cells. In another embodunent, many preferred TAAs are the 
result of deregulated gene expression in transformed cells, but are found also in tissues of the 
testis, ovaries and fetus. In another embodiment, useful TAAs are expressed at higher levels in the 

20 target cell than in other cells. In still other embodiments, TAAs are not differentially expressed in 
the target cell compared to other cells, but are still useful since they are involved in a particular 
function of the cell and differentiate the target cell from most other peripheral cells; in such 
embodiments, healthy cells also displaying the TAA may be collaterally attacked by the induced 
Tcell response, but such collateral damage is considered to be far preferable to the condition 

25 caused by the target cell. 

When neoplastic cells are the target, preferred antigens include TuAAs. Examples of 
protein antigens suitable for use include differentiation antigens such as MelanA (MART-I), gplOO 
(Pmel 17), tyrosinase, TRP-1, TRP-2, and tumor-specific multilineage antigens such as MAGE-1, 
MAGE-3, BAGE, GAGE-1, GAGE-2, CEA, RAGE, NY-ESO, SCP-1, Hom/Mel-40 and FRAME. 

30 Similarly, TuAAs include overexpressed oncogenes, and mutated tumor-suppressor genes such as 
p53, H-Ras and HER-2/neu. Additionally, unique TuAAs resulting from chromosomal 
translocations such as BCR-ABL, E2A-PRL, H4-RET, IGH-IGK, MYL-RAR and viral antigens 
such as Epstein Barr virus antigens EBNA, and the human papillomavirus (HPV) antigens E6 and 
E7 are included. Other useful protein antigens include but are not limited to TSP-180, MAGE-4 

35 MAGE-S, MAGE.6, pl85erbB2, pl80erbB-3, c-met, nm-23Hl, PSA, TAG-72-4, CAM 17.1, 
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NuMa, K-ras, p-Catenin, CDK4, Mum-1, and pl6. These and other TuAAs and pathogen-related 
antigens are known and available to those of skill in the art m the literature or commercially. 

In a further embodiment, the TAA is an antigen specific for a virus. See Table 2. In yet 
another embodiment of the present invention, the TAA is an antigen specific for a non-viral 
5 intracellular parasite. Examples of parasite-specific antigens include nucleotides, proteins, or other 
gene products associated with the intracellular parasite . Suitable nucleotides or proteins can be 
found at the NCBI Taxonomy Database located at 
http:/Avww.ncbi.nlm.nih.gov/Taxonomy/tax.html/ . More detailed descriptions of gene products 
for parasites and other pathogens are provided at this web site. 

10 

Table 2 



Virus 


Candidate Gene Products 


Herpes Simplex I 


ICP4, VP16, ICPO, yl^, gl3 


EBV 


ZTA, EBNA-2, EBMA-l, LMP-1, LMP.2, LMP.2a, LMP-2b 




EBNA-3, EBNA-4, EBNA-LP, EBNA-3A, 3C, BZLF-1 


Poxvirus 


VeTF, K3L, p37, A14L, A13L, A17L, A18R 


SV40 


Large T antigen. Small T antigen, VPZ 


Adenovirus 


ElA, E3L, BIB, E4 (0EF6), E4 (ORFl), gpl9K, ADP, RIDo, RIDp 


Hepatitis B 


pX,L(pre-SO 


Htlv-1 


Tax 


HTV 


TAT, GAG, MA, ENV, TM, NEF, VIF, VPR, REV, VPX 


Hepatis B 


NS5A 


Reovirus 


8-3 


Rous Sarcoma Virus 


pp60'" 


Harvey Sarcoma Virus 


p21~ 


HPV 


E6, E7, E5 


Polyomavirus 


LT,mT,sT 



The compounds and methods described herein are effective in any context wherein a target 
IS cell displays housekeeping epitopes. Methods of discovering effective epitopes for use in 
connection with the vaccine and treatment embodiments of this mvention are disclosed herein. 
Proteolytic Processing of Antigens 

Epitopes that are displayed by MHC on target cells or on pAPCs are cleavage products of 
larger protein antigen precursors. For MHC I q)itopes, protein antigens are digested by 
20 proteasomes resident in the cell. See Figure 1. Intracellular proteasomal digestion ^ically 
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produces peptide firagments of about 3 to 23 amino acids in length. Additional proteolytic 
activities witfain liie cell, or in the ejctracellular milieu, can trim and process these fragments 
further. Processing of MHC II epitopes occurs via intracellular proteases from the 
lysosomal/endosomal compartment. 
5 Presumably, most products of protein processing by proteasomes or other protease 

activities have litfle or no affinity for the binding cleft of a particular MHC receptor peptide. 
However, the processing products that do have such an affinity are likely to be presented, at some 
level of abundance, by MHC at the cell surface. Conversely, if a given oligopeptide sequence does 
not emerge intact from the antigen processing activities of the cell, it cannot be presented at the 

1 0 cell surface, regardless of the predicted affinity of tiie sequence for MHC. 

Vaccine design that focuses entirely on MHC affinity is fundamentally flawed. The mere 
fact that a peptide has MHC binding affinity does not ensure that such a peptide will make for a 
fimctional immimogen. To provide an epitope capable of eliciting an effective unmune response 
against a TAA, the peptide must have MHC binding affinity and be the product of cellular peptide 

15 generating systems. The metiiods of the disclosed invention utilize both MHC binding affinity 
analsrsis and antigen processmg analysis protocols to identify new epitopes of interest 
Correlating Pred icted or Known MHC Binding with Proteolytic Processing of Antigens 

To identify epitopes potentially effective as immunogenic compounds, predictions of MHC • 
bmding alone generally are disadvantageous, because many fragments with predicted binding are 

20 never actually formed in the cell. Embodiments of the mvention combine an analysis of MHC 
bindmg with an analysis of proteolytic processmg to identify epitopes that have both of the 
essential properties of a useful epitope: MHC affinity and correct proteolytic processing. Peptides 
having both of these properties are strong candidates for vaccines and immunotherapies. Peptides 
lacking either of these properties are unlikely to have any significant opportunity to function as 

25 effective epitopes. 

Embodunents of the mvention are capable of identifying epitopes derived bom TAAs for 
use in vaccines. Hie target antigens can be derived from neoplastic cells, cells infected with a 
virus or other intracellular parasite, or cells infected with other pathogenic agents such as bacteria, 
fungi, protozoans, viruses, prions, toxins, venoms, allergens, and the like. In short, embodiments 

30 of the method can be applied to virtually any protein sequence, to identify therein epitopes capable 
of generation by proteolysis and capable of binding to MHC. Accordingly, the invention is not 
limited to any particular target or medical condition, but instead encompasses discovery of 
biologically relevant MHC epitopes from any tiseflil source. 

In a preferred embodiment, the TAA is characteristic of a neoplastic cell and is thus 

35 defined as a tumor-associated antigen (TuAA). Preferred TuAAs include: differentiation antigens 
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such as MelanA (MART-1), gplOO (Pmel 17), tyrosinase, TRP-l, lllp.2, MAGE-1, MAGE-3, 
BAGBy GAGE-1, GAGE-2, pl5; overexpressed embiyonic antigens generally; overexpressed 
oncogenes and mutated tumor-suppressor genes such as p53, Ras, HER-2/neu; unique tumor 
antigens resulting from chromosomal translocations such as BCR-ABL, E2A-PRL, H4-RET, IGH- 
5 IGK, MYL-RARl and vu^ antigens, such as Epstein Barr virus antigens (EBVA) and the human 
papillomavirus (HPV) antigens E6 and E7, Otiier antigens of interest include prostate specific 
antigen (PSA), prostate stem cell antigen (PSCA), MAAT-1, GP-lOO, TSP-180, MAGE-4, MAGE- 
5, MAGE-6, RAGE, pl85erbB-2, pl85erbB-3, c-met, nm-23Hl, TAG-72, CA 19-9, CA 72-4, 
CAM 17.1, NuMa, K-ras, -Catenin, CDK4, Mum-1, pl5, and pl6. Other target antigens are also 
10 contemplated. 

A variety of methods are available and well known in the art to identify TuAAs, Examples 
of these techniques mclude differential hybridization, including the use of microarrays; subtractive 
hybridization cloning; differential display, either at the level of mRNA or protein expression; EST 
sequencmg; and SAGE (sequential analysis of gene expression). TTiese nucleic acid techniques 

15 have been reviewed by Carulli, J J. et al J. Cellular Biochem Suppl. 30/31:286-296, 1998. 
Differential display of proteins involves, for example, comparison of two-dimensional 
polyacrylamide gel electrophoresis of cell lysates from tumor and normal tissue, location of protein 
spots unique or overexpressed in the tumor, recovery of the protein from the gel, and identification 
of the protein using traditional biochemical or mass spectrometry sequencing techniques. An 

20 additional technique for identification of TuAAs is the SEREX technique, discussed in Tureci, 0., 
Sahin, U., and Pfreundschuh, M., "Serological analysis of human tumor antigens: molecular 
deJBnitibn and unplications". Molecular Medicine Today, 3:342, 1997. Use of these and other 
methods provides one of skill in the art the techniques necessary to identify useful antigens for 
generatmg housekeeping and fanmune class I epitopes, as well as class II epitopes for a vaccines. 

25 However, it is not necessary, in practicmg the invention, to identify a novel TuAA or TAA. 
Rather, embodiments of the invention make it possible to identify useful epitopes from any 
relevant protein sequence, whether the sequence is already known or novel. 
Analysis of TAA Fragments for MHC Binding 

In order to identify biologically relevant epitopes, fragments withui the TAA with a known 

30 or predicted affinity for MHC are identified. The amino acid sequence of a TAA can be analyzed 
by a number of different techniques with which to identify peptide fragments having a known or 
predicted affinity for the MHC peptide binding cleft. In one embodiment of the invention, TAA 
fragments are analyzed for theh predicted ability to bind to the MHC peptide binding cleft usmg a 
computer algoritfmi. Each allele of MHC specifies a particular epitope binding domain. Thus, for 

35 any given MHC allele, the candidate peptides can be screened for predicted affinity thereto. 
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Examples of suitable computer algorithms for this purpose include that found at the world wide 
web page of Hans-Geoi^g Rammensee, Jutta Bachmann, Niels Emmerich, Stefan Stevanovic; 
SYFPEITHI: An Internet Database for MHC Ligands and Peptide Motifs (http://access via: 
syj^eitM.bmi-heidelberg.com/scripts/MHCServer.dll/home^ Results obtained from this 

5 method are discussed in Rammensee, et al., "MHC Ligands and Peptide Motifs,** Landes 
Bioscience Austin, TX, 224-227, 1997. Another http:// site of interest is 
"bimas.dcrt.nih.gov/molbio/hla_bmd," which also contains a suitable algorithm. The methods of 
this web site are discussed in Parker, et al., "Scheme for ranking potential HLA-A2 binding 
peptides based on independent binding of individual peptide side-chams," J. Immunol. 
10 152:163-175. 

Using the NIH (Parker) algorithm with the methods of the invention would select peptides 
using a number of possible retention times to indicate a binding sequence. In one embodiment, 
peptides with an infinite retention time would be selected. In another embodiment, peptides with a 
retention time of 25 minutes or more would be selected to indicate a binding sequence. In still 
15 another embodiment, a retention time of 15 minutes or more would be selected to indicate a 
binding sequence. In still another embodiment, a retention time of 10 minutes or more would be 
selected to indicate a binding sequence. Retention times of 9, 8, 7, 6, 5, 4, 3, 2, and 1 minute are 
also contemplated. 

As an alternative to predictive algorithms, a number of standard in vitro receptor binding 
20 affinity assays are available to identify peptides having an affinity for a particular allele of MHC. 
Accordingly, by the method of this aspect of the invention, the initial population of peptide 
fragments can be narrowed to include only those peptides having an actual or predicted affinity for 
tfie selected allele of MHC. 

Initially, peptide candidates for this analysis can include every possible sequence of about 
25 6 to 24 contiguous amino acids from the entire protein sequence of the TAA. In a preferred 
embodiment, the sequences can be from about 7 to 20 amino acids in length. In a more preferred 
embodiment, the sequences can be from about 8 to 15 amino acids in length. For sequence 
analysis to identify fragments with predicted affinity for MHC I, a most preferred embodiment 
analyzes all possible sequences of 9 or 10 contiguous amino acid fragments of the TAA. Analysis 
30 of the MHC affinity of the fragments can be conducted in vitro or via computer analysis of the 
fragments. 

Selected common alleles of MHC I, and their approximate frequencies, are reported in the 
tables 3-5 below. 
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Table 3. Estimated gene frequencies of HLA-A antigens 



CAU 



Gf 



AFR 



Gf 



SE 



ASI 



Gf 



SE 



LAT 



Gf 



SE 



NAT 



Gf 



SE 



A2 
A3 
A28 
A36 



15.1843 

28.6535 

13.3890 

4.4652 

0.0221 



0.0489 
0.0619 
0.0463 
0.0280 
0.0020 



5.7256 

18.8849 

8.4406 

9.9269 

1.8836 



0.0771 
0.1317 
0.0925 
0,0997 
0.0448 



4.4818 

24.6352 

2.6454 

1.7657 

0.0148 



0.0846 
0.1794 
0.0655 
0.0537 
0.0049 



7.4007 

28.1198 

8.0789 

8.9446 

0.1584 



0.0978 
0.1700 
0.1019 
0.1067 
0.0148 



12.0316 

29.3408 

11.0293 

5.3856 

0.1545 



0.2533 
0.3585 
0.2437 
0.1750 
0.0303 



A23 
A24 

A9 unsplit 
A9 total 



I. 8287 
9.3251 
0.0809 

II. 2347 



0.0181 
0.0395 
0.0038 
0.0429 



A25 



2.1157 



10.2086 
2.9668 
0.0367 
13.2121 



0.1010 
0.0560 
6.0063 
0.1128 



0.3256 
22.0391 
6.0858 
22.4505 



0.0231 
0.1722 
0.0119 
0.1733 



2.9269 
13.2610 
0.0537 
16.2416 



0.0628 
0.1271 
0.0086 
0.1382 



1.9903 
12.6613 
0.0356 
14.6872 



0.1080 
0.2590 
0,0145 
0.2756 



0.0195 



0.4329 



0.O216 



0.0990 



0,0128 



.1937 



0.O4O4 



1.4520 



0.0924 



A26 
A34 
A43 
A66 

A10i]nsplit|0 
AlO total 



3.8795 
0.1508 
0.0018 
0.0173 
,0790 
6.2441 



0.0262 
0.0052 
0.0006 
0.OO18 
0.0038 
0.0328 



2.8284 
3.5228 
0.0334 
0.2233 
0.0939 
7.1348 



0.0547 
0.0610 
0.0060 
0,0155 
0.0101 
0.0850 



[4.6628 
1.3529 
0.0231 
0.0478 
0.1255 
6.3111 



0.0862 
0.0470 
0.0062 
0.0089 
0.0144 
0.0993 



3.2612 
0.4928 
0.OO55 
0.0399 
0.0647 
5.0578 



0.0662 
0,0260 
0.0028 
0.0074 
0.0094 
0.0816 



2.4292 
0.3150 
0.0O59 
0.0534 
0.O298 
4,2853 



0,1191 
0.0432 
0.0059 
0.0178 
0.0133 
0.1565 



A29 
A30 
A31 
A32 
A33 
A74 

A19 unsplit 0 
A19 total 



3.5796 
2.5067 
2.7386 
3.6956 
1.2080 
0.0277 
•.0567 
13.8129 



0.0252 
0.0212 
0.0221 
0.0256 
0.0148 
0.0022 
0.0032 
0.0468 



13.2071 

13.0969 

1.6556 

1.5384 

16.5607 

1.9949 

0.2057 

28.2593 



0.0582 
0.1129 
0.0420 
0.0405 
0822 
0461 
0.0149 
0.1504 



1.1233 
22025 
3.6005 
1.0331 
9.2701 
0.0561 
0.0990 
17.3846 



0.0429 
0.0598 
0,0761 
0.0411 
0,1191 
0096 
0.0128 
0.1555 



4.5156 
4.4873 
4.8328 
2.7064 
2.6593 
0.2027 
0.1211 
19.5252 



0.0774 
0.0772 
0.0800 
0.0604 
0.0599 

0167 
0.0129 

•.1481 



3.4345 
2.5314 
6.0881 
2.5521 
1.0754 
0.1068 
0.0475 
15.8358 



M410 
10.1215 
1.1855 
1.1220 
•.0796 
.0252 
0.0168 
0.2832 



AX . |0.8204 p.0297 |4.9506 [0.0963 |2.9916 0.1177 



1,6332 0.0878 



8454 0.1925 



*Gene frequency. 
**Standard error. 



Table 4. Estimated gene frequencies for HLA-B antigens 



Antigen 


CAU 




AFR 


ASI 


LAT 


NAT 


Gf 




Gf 


SE 


Gf 


SE 


Gf 


SE 


Gf 


SE 


B7 


12,1782 


0.0445 


10.5960 


0.1024 


4.2691 


0.0827 


6.4477 


0.0918 


10.9845 


02432 


B8 


9.4077 


0.0397 


3.8315 


0.0634 


1.3322 


0.0467 


3.8225 


0.0715 


8.5789 


02176 


B13 


2.3061 


0.0203 


0.8103 


0.0295 


4.9222 


0.0886 


1.2699 


0.0416 


1.7495 


0.1013 


B14 


4.3481 


0.0277 


3.0331 


0.0566 


0.5004 


0.0287 


5.4166 


0.0846 


2.9823 


0.1316 


B18 


4.7980 


0.0290 


3.2057 


0.0582 


1.1246 


0.0429 


4.2349 


0.0752 


3.3422 


0.1391 


B27 


4.3831 


0.0278 


1.2918 


0.0372 


2.2355 


0.0603 


2.3724 


0.0567 


5.1970 


0.1721 


B35 


9.6614 


0.0402 


8.5172 


0.0927 


8.1203 


0.1122 


14.6516 


0.1329 


10.1198 


02345 


B37 


1.4032 


0.0159 


0.5916 


0.0252 


12327 


0.0449 


0.7807 


0.0327 


0.9755 


0.0759 


B41 


0.9211 


0.0129 


0.8183 


0.0296 


0.1303 


0.0147 


12818 


0.0418 


0.4766 


0.0531 


B42 


0.0608 


0.0033 


5.6991 


0.0768 


0.0841 


0.0118 


0.5866 


0.0284 


02856 


0.0411 


B46 


0,0099 


0.0013 


0.0151 


0.0040 


4.9292 


0.0886 


0.0234 


0.0057 


0.0238 


0.0119 


B47 


0.2069 


0.0061 


0,1305 


0.0119 


0.0956 


0.0126 


0.1832 


0.0159 


02139 


0.0356 
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AntigCD 


CAU 


J „^r; 


AFR 


ASI 


T.AT 


NAT 


Gf 




Gf 


SE 


Gf 


SE 


Gf 


SE 


Gf 


SE 


B48 
B53 
B59 
B67 
B70 
B73 


0.0865 
0.4620 
0.0020 
0.0040 
0.3270 
0.0108 


0.0040 
0.0092 
0.0006 
0.0009 
0.0077 
0.0014 


0.1316 

10.9529 

0.0032 


0.0119 
0.1039 
0.0019 


2.0276 
0.4315 
0.4277 


0.0575 
0.0266 
0.0265 


1.5915 
1.6982 
0.0055 


0.0466 
0.0481 
0.0028 


1.0267 
1.0804 
0° 


0.0778 
0 079ft 


0.0086 
7.3571 
0.0032 


0.0030 
0.0866 
0.0019 


0.2276 
0.8901 
0.0132 


0.0194 
0.0382 
0.0047 


0.0055 
1.9266 
0.0261 


0.0028 
0.0512 
0.0060 


0.0059 
0.6901 

0*" 


0.0059 
0.0639 


B51 
B52 

B5 unsplit 
B5 total 


5.4215 

0.9658 
0.1565 
6.543S 


0.0307 
0.0132 
0.0053 
0.0435 


2.5980 

1.3712 
0.1522 
4.1214 


0.0525 

0.0383 
0.0128 
0.0747 


7.4751 

3.5121 
0.1288 
11.1160 


0.1080 

0.0752 
0.0146 
0.1504 


6.8147 

2.2447 
0.1546 
92141 


0.0943 
0.0552 
0.0146 
0.1324 


6.9077 
0.6960 
0.1307 
7.7344 


0.1968 
0.0641 
0.0278 
02784 


B44 
B45 

B12 unsplit 
B12 total 


13.4838 
0.5771 
0.0788 
14.1440 


0.0465 
0.0102 
0.0038 
0.0474 


7.0137 
4,SQ69 
0.0280 
11.8486 


0.0847 
0.0708 
0.0055 
0.1072 


5.6807 
0.1816 
0.0049 
5.8673 


0.0948 
0.0173 
0.0029 
0.0963 


9.9253 

I. 8812 
0.0193 

II. 8258 


0.1121 
0.0506 
0.0051 
0.1210 


11.8024 

0.7603 

0.0654 


02511 
0.0670 
0.0197 

0 7^0/1 


B62 
B63 
B75 
B76 
B77 

B15 unsplit 
B15 total 


5.9117 
0.4302 
0.0104 
0.0026 
0.0057 
0.1305 
6.4910 


0.0320 
0.0088 
0.0014 
0.0007 


1.5267 
1.8865 
0.0226 
0.0065 


0.0404 
0.0448 
0.0049 
0.0026 


9.2249 
0.4438 
1.9673 
0.0874 


0.1190 
0.0270 
0.0566 
0.0120 


4.1825 
0.8083 
O.IIOI 
0.0055 


0.0747 
0.0333 
0.0123 
0.0028 


6.9421 
0.3738 
0.0356 

0= 

0.0059 
0.0715 
7 il70n 


0.1973 
0.0471 
0.0145 


0.0010 
0.0049 
0.0334 


0.0119 
0.0691 
3.5232 


0.0036 
0.0086 
0.0608 


0.0577 
0.4301 
12.2112 


0.0098 
0.0266 
0.1344 


0.0083 
0.1820 
52967 


0.0034 
0.0158 
0.0835 


0.0059 
0.0206 
02035 


B38 
B39 

B16 unsplit 
BI6 total 


2.4413 
1.9614 
0.0638 

4.4667 


0.0209 
0.0188 
0.0034 

0.0280 


0.3323 
1.2893 
0.0237 

1.6453 


0.0189 
0.0371 
0.0051 
0.0419 


3.2818 
2.0352 
0.0644 
5.3814 


0.0728 
0.0576 
0.0103 
0.0921 


1.9652 
6.3040 
0.1226 
8.3917 


0.0517 
0.0909 
0.0130 
0.1036 


1.1017 
4.5527 
0.0593 

S 71*^7 

J. / XJf 


0.0806 
0.1615 
0.0188 
w. 1 /y / 


B57 
B58 

B17 unsplit 
B17 total 


3.5955 
0.7152 
0.2845 
4.5952 


0.0252 
0.0114 
0.0072 
0.0284 


5.6746 
5.9546 
0.3248 
11.9540 


0.0766 
0.0784 
0.0187 
0.1076 


2.5782 
4.0189 
0.3751 
6.9722 


0.0647 
0.0803 
0.0248 
0.1041 


2.1800 
12481 
0.1446 
3.5727 


0.0544 
0.0413 
0.0141 
0.0691 


2.7265 
0.9398 

0.2674 
3.9338 


0.1260 • 
0.0745 

0.0398 
0.1503 . 


B49 
B50 

B21 unsplit 
B21 total 


1.6452 
1.0580 
0.0702 
2.7733 


0.0172 
0.0138 
0.0036 
0.0222 


2.6286 
0.8636 
0.0270 
3.5192 


0.0528 
0.0304 
0.0054 
0.0608 


02440 
0.4421 
0.0132 
0.6993 


0.0200 
0.0270 
0.0047 
0.0339 


2.3353 
1.8883 
0.0771 
4.3007 


0.0507 
0.0103 
0.0755 


0.7862 
0.0356 
2.3680 


0.0681 
0.0145 
0.1174 


B54 
B55 
B56 

B22 unsnlit 
B22 total 


0.0124 
1.9046 
0.5527 

2.0852 


0.0015 
0.0185 
0.0100 

0.0217 


0.0183 
0.4895 
0.2686 
0.0496 
0.8261 


0.0044 
0.0229 
0.0170 
0.0073 
0.0297 


2.6873 
22444 
0.8260 
0.2730 
6.0307 


0.0660 
0.0604 
0.0368 
0.0212 
0.0971 


0.0289 
0.9515 
0.3596 
0.0372 
1.3771 


0.0063 
0.0361 
0.0222 
0.0071 
0.0433 


1.4054 
0.3387 
0.1246 
1.9221 


v.vl /O 

0.0909 
0.0448 
0.0272 
0.1060 


B60 
B61 

B40 unsplit 
B40 total 


5.2222 
1.1916 
0.2696 
6.6834 


0.0302 
0.0147 
0.0070 
0.0338 


1.5299 
0.4709 
0.0388 
2.0396 


0.0404 
0.0225 
0.0065 
0.0465 


8.3254 
62072 
0.3205 
14.8531 


0.1135 
0.0989 
0.0230 
0.1462 


22538 
4.6691 
02473 
7.1702 


0.0553 
0.0788 
0.0184 
0.0963 


5.7218 
2.6023 
0.2271 
8.5512 


0.1801 
0.1231 
0.0367 
02168 


BX 


1.0922 


0.0252 


3.5258 


0.0802 


3.8749 


0.0988 


2.5266 


0.0807 


1.9867 


0.1634 



•GenefrequeiMqr. 'Standard error. The observed gene count was zero. 
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Table 5. Estimated gene frequencies of HLA-DR antigens 



Antigen 


CA 


lU 


AFR 


ASI 


LAT 


NAT 


Gf 




Gf 


SE 


Gf 


SE 


Gf 




VJX 




DRl 


10.2279 


0.0413 


6 8200 


0,0832 


3.4628 


0 0747 




1 Al 'J 




0.2139 


DR2 


15.2408 


0.0491 


16.2373 


0.1222 


18 6162 


0 ^f^0R 

Vrl wo 




n n R7 




n oo 1 o 


DR3 


10.8708 


0.0424 


13.3080 


0.1124 


4.7223 


0.0867 


1 8998 


0 inn8 






DR4 


16.7589 


0.0511 


5.7084 


0.0765 


15.4623 


0.1490 


20.5373 


0.1520 


19.8264 


0.3123 


DR6 


14.3937 


0.0479 


18.6117 


0.1291 


13.4471 


0.1404 


17.0265 


0.1411 


14.8021 


0.2772 


JU/Xv / 




U.U4oi 


10.1317 


0.0997 


6.9270 


0.1040 


10.6726 


0.1155 


10.4219 


0.2378 


DR8 


2.8820 


0.0227 


6.2673 


0.0800 


6.5413 


0.1013 


9.7731 


0.1110 


6.0059 


0.1844 


DR9 


1.0616 


0.0139 


2.9646 


0.0559 


9.7527 


0.1218 


1.0712 


0.0383 


2.8662 


0.1291 


DRIO 


1.4790 


0.0163 


2.0397 


0.0465 


2.2304 


0.0602 


1.8044 


0.0495 


1.0896 


0.0801 


DRll 


9.3180 


0.0396 


10.6151 


0.1018 


4.7375 


0.0869 


7.0411 


0.0955 


5.3152 


0.1740 


DR12 


1.9070 


0.0185 


4.1152 


0.0655 


10.1365 


0.1239 


1.7244 


0.0484 


2.0132 


0.1086 


DR5unsplit 


1.2199 


0.0149 


2.2957 


0.0493 


1.4118 


0.0480 


1.8225 


0.0498 


1.6769 


0.0992 


DR5 total 


12.4449 


0.0045 


17.0260 


0.1243 


16.2858 


0.1516 


10.5880 


0.1148 


9.0052 


0.2218 


DRX 


1J598 


0.0342 


0.8853 


0.0760 


2.5521 


0.1089 


1.4023 


0.0930 


2.0834 


0J2037 



'Gene frequency. 
5 ''Standard error. 



Tables 3, 4, and 5 derived from HLA Gene and Haplotype Frequencies in the North 
American Population: The National Marrow Donor Program Donor Registry, Mori, M. et al. 
Determining Whet her a Fragment with MHC Affinity is a Useful Epitope 

10 As discussed above, a preliminary step of the disclosed method is to select from among the 

original population of peptide fragments a subpopulation of peptides with an actual or predicted 
MHC affinity. The selected fragments are analyzed ftirther to determine which can be produced by 
a cell under in vivo conditions that could result in binding of the peptide to the selected MHC 
aUele. All peptides that meet both criteria of MHC affinity and correct proteolytic processing are 

15 designated as "discovered epitopes.** A variety of methods are available for determining which 
peptide fragments can be produced by proteolytic processing in vivo. These methods include 
elution of peptides from solubilized MHC and intact cells, computer sequence analysis of the 
proteolytic cleavage motifs, and in vitro analysis of actual peptide fragments produced by cellular 
proteolytic machinery. 

20 In a preferred embodiment, a series of synthetic peptides centrally contaming either 

individual or clustered candidate peptide sequences can be generated. Such peptides typically 
range in length from about 10 to about 75 amino acids. In a preferred embodiment, the synthetic 
peptide is between about 20 and 60 amino acids in length. In a more preferred embodiment, the 
cluster is between about 30 and 40 ammo acids in length. Using standard peptide synthesis 

25 chemistry, including t-Boc protection chemistry, Fmoc protection chemistry, and the like, one of 
ordinary skill in art the can produce a population of candidate peptides for subsequent screening. 

Alternatively, peptide fragments containing candidate peptides can be generated in vitro 
through protease digestion or chemical cleavage of the TAA or fragments thereof. Protease 
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digestion to prepare such fragments of TAAs can employ a wide variety of known proteases, 
including but not limited to proteasome proteases, trypsin, -chymotiypsin, bromelain, clostripain, 
elastase, endoproteinases, exoproteinases, proteinase K, ficin, papain, pepsin, plasmin, 
thennolysin, tiirombin, Irypsin, cathepsins, and ofliers. Chemical methods can also be used to 
generate peptide candidates. Suitable chemicals or chemical reactions for cleaving peptide bonds 
include mild acid cleavage, cyanogen bromide, hydroxylamine, iodosobenzoic acid, 2-Nitro-5- 
thiocyanobenzoate, and the like. In one embodiment, the unfragmented TAA can be used, 
although the use of a particularly large initial sequence can complicate the analysis. 

Regardless of how flie fragments containing candidate peptides are created, determining 
which epitopes are produced by the cellular machinery is important. In one embodiment of the 
invention, proteasome digestion is used to estimate cellular epitope generation. In this 
embodunent, immune and housekeepmg proteasomes are purified for m vitro use m order to assess 
the antigenic repertofre generated naturally from the two kinds of proteasomes. 

Generally, proteasomes are prepared by affmity purification from cell extracts. In a 
15 preferred embodiment, a cell lysate is prepared using standard techniques. The lysate is cleared by 
ultracentrifugation if erythrocytes are not the original source material. The prepared cell lysate is 
then purified firan other cellular components usmg any one of a number of purification techniques 
including various forms of chromatography. 

In one embodiment afBnity chromatography is used to purify the proteasomes. The cell 
20 lysate is applied to an affinity column containing a monoclonal antibody (mAb) against one of the 
proteasomal subunits. The column is then washed to purify the bound proteasomes from other 
ceUular material. Following washing, the bound proteasomes are then eluted from the column. 
The eluate is characterized m terms of protem content and proteolytic activity on a standard 
substrate. 

25 Cleavage analysis using both housekeeping and unmune proteasomes yields class I 

epitopes from various TAA. The epitopes that are presented by pAPCs correspond to cleavage 
products of the immune proteasome, while the epitopes presented by tumors and by many cells 
chronicaUy infected wilh intracellular parasites correspond to cleavage products of the 
housekeeping proteasome. Once the digest is performed, the particular molecular species produced 

30 are identified. In a preferred embodiment, this is accomplished by mass spectrometry. This allows 
the rapid identification of natural peptide fragments that are produced by either of the two kinds of 
proteasomes. In anoflier embodiment, cleavage of the target antigen or fragments thereof by 
immune and housekeeping proteasomes, or by endosomal/lysosomal proteases (see below), is 
predicted by computer modeling based on cleavage motife of the relevant proteolytic activities. 
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Whereas class I A4HC is loaded primarily with proteasomally derived peptides as it 
initially folds fai the endoplasmic reticulum, flie binding cleft of class II MHC is blocked by the so- 
called invariant chain (li) in this compartment Loadmg of peptide for class n MHC takes place 
primarily m the endosomal compartment, utilizing peptides generated by endosomal and lysosomal 
5 proteases. Thus if in vitro identification of MHC class II epitopes is desired, preparations of 
proteases fsom endosomal and/or lysosomal fractions can be substituted for the proteasomes. A 
variety of methods to accomplish this substitution are described in the literature. For example, 
Kido & Ohshita, Anal. Biochem., 230:41-7 (1995); Yamada, et al., J. Biochem. (Tokyo), 95:1 155- 
60 (1984); Kawashima, et al. Kidney Int., 54:275-8 (1998); Nakabayshi & Ikezawa, Biochem. Int. 
10 16:1119-25 (1988); Kanaseki & Ohkuma, J. Biochem. (Tokyo), 110:541-7 (1991); Wattiaux, et al., 
J. CeU BioL, 78:349-68 (1978); Lisman, et al., Biochem. J. 178:79-87 (1979); Dean, B., Arch. 
Biochem. Biophys., 227:154-63 (1983); Overdijk, et al.. Adv. Exp. Med. Biol., 101:601-10 (1978); 
Stromhaug, et al., Biochem. J., Biochem. J., 335:217-24 (1998); Escola, et al., J. Biol. Chem. 
271:27360-5 (1996); Hammond, et al.. Am. J. Physiol., 267:F516.27 (1994); Williams & Smith, 
15 Arch. Biochem. Biophys. 305:298-306 (1993); Marsh, M., Methods CeU Biol., 31:319-34 (1989); 
and Schmid & and Melhnan, Prog. Clin. Biol. Res., 270:35-49 (1988) all disclose methods to 
prepare suitable proteolytic preparations. 

In another embodiment, the digestion to determine which epitopes the cellular machinery 
produces, takes place vwthm a cell expressing the TAA or a fragment thereof For class I epitopes 
20 it is preferred that the type of proteasome expressed by tiie cell be detennined, for example, by 
western blotting. Hie MHC epitopes produced can then be eluted from either solubilized and 
purified MHC as described m Falk, K. et al. Nature 351:290, 1991, or directly from the intact cell 
as described in U.S. Patent 5,989,565. Eluted fragments are then identified by mass spectrometry. 
Analysis of Target Protein Fragments 
25 The molecular species detected by mass spectrometry are compared with the candidate 

peptides predicted above. For the case of class I epitopes, species that are as long as, or longer 
than, a candidate peptide and share its C-terminus are desired; N-terminal trimming of at least up 
to 25 ammo acids can occur independently of the proteasome (Craiu, A. et al. Proc. Natl. Acad. 
Sci. USA 94:10850-55, 1997). Class H MHC is highly tolerant in tenns of the length of the 
30 peptides it will bmd, so the absence of cleavage in the middle of tiie epitope becomes the primary 
criterion, mther than generation of a correct end. 

A selected digestion product is then synthesized and used as a standard in an analytic 
method such as HPLC versus an aliquot of the digest This provides a further check on the identity 
of the digestion product and allows its yield to be determined. In rare cases more than one 
35 potential product may have similar enough masses and chemical characteristics that they may not 
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be reliably diflFerentiated by these methods. In such cases the HPLC peak can be collected and 
subjected to direct sequencmg to confum identity. 
Analysis of Peptides for MHC Binding 

The epitope is synthesized and tested for its ability to bind a MHC receptor. For example, 
5 in one preferred assay, cells displaying the MHC I receptor can be used to measure the binding 
affinity of candidate peptides labeled with a radionuclide. Another preferred approach measures 
tiie ability of a peptide to bmd to an MHC I receptor using a cell culture-based assay. In this assay, 
cells lacking transporters associated with antigen processing (TAP) are used to detennine whether 
or not a candidate peptide has the ability to bind to the MHC I receptor. TAP' cells have the 

10 phenotype in which class I MHC proteins do not always fold properly, and surface expression of 
MHC I is thus reduced or abolished. When the cell is flooded with exogenous peptide that can 
bind to the MHC I cleft, expression of the receptor is restored. This can be monitored by several 
means such as RIA, FACS, and the like. Using TAP" cells, one of skill in the art can screen large 
numbers of potential candidate peptides for receptor bmding without having to perfoim detailed 

15 binding afBnity analysis. 

The analysis methods of the various embodiments of the invention are useful in examining 
candidate peptides generated in a variety of ways. For example, the described analysis can be used 
in evaluating multiple candidate peptides generated through m vitro methods or by computational 
analysis, to identify those candidate sequences that have MHC receptor binding characteristics. 

20 Preferred candidate peptides in this embodiment of the invention are those that are already known 
to be products of proteolytic production by housekeeping and/or immune proteasomes. Both in 
vivo cleavage products and in vitro cleavage products tiiat are shown or predicted to bmd to MHC 
are properly designated as **discovered epitopes.'' Epitope clusters for use in connection witii this 
invention are disclosed herein. 

25 ECRs are Processed into MHC-Bindme Epitopes in pAPCs 

The inunune system constantiy surveys the body for the presence of foreign antigens, m part 
through the activity of pAPCs. The pAPCs endocytose matter found in the extracellular milieu, 
process that matter from a polypeptide form mto shorter oligopeptides of about 3 to 23 ammo acids m 
length, and display some of the resultingpeptides to T cells via the MHC complex of flie pAPCs. For 

30 example, a tumor cell upon lysis releases its cellular contents, includmg various proteins, into the 
extracellular milieu. Those released proteins can be endocytosed by pAPCs and processed into 
discrete peptides that are tiien displayed on the surface of tiie pAPCs via the MHC. By this 
mechanism, it is not the entire target protein that is presented on the surface of the pAPCs, but rather 
only one or more discrete fragments of that protein that are presented as MHC-bindmg epitopes. If a 

35 presented epitope is recognized by a T cell, tiiat T cell is activated and an immune i^ponse results. 
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Similarly, the scavenger receptors on pAPC can take-up naked nucleic acid sequences or 
recombinant organisms containing target nucleic acid sequences. Uptake of the nucleic acid 
sequences into the pAPC subsequently results in the expression of the encoded products. As above, 
when an ECR can be processed into one or more useful epitopes, these products can be presented as 
5 MHC epitopes for recognition by T cells. 

MHC-binding epitopes are often distributed unevenly throughout a protein sequence in 
clusters. Embodiments of the mvention are directed to identifying epitope cluster regions (ECRs) 
in a particular region of a target protein. Candidate ECRs are likely to be natural substrates for 
various proteolytic enzymes and are likely to be processed into one or more epitopes for MHC 
10 display on the surfece of an pAPC. In contrast to more traditional vaccines that deliver whole 
protems or biological agents, ECRs can be administered as vaccines, resulting in a high probability 
that at least one epitope will be presented on MHC without requiring the use of a fiiU length 
sequence. 

The Use of EC Rs in Identifying Discrete MHC-Binding Epitopes 

15 Identifying putative MHC epitopes for use in vaccmes often includes the use of available 

predictive algorithms that analyze the sequences of proteins or genes to predict binding affinity of 
peptide fragments for MHC. These algorithms rank putative epitopes according to predicted 
af35nity or other characteristics associated with MHC binding. Exemplary algorithms for this kmd 
of analysis include the Rammensee and NIH (Parker) algorithms. However, identifymg epitopes 

20 that are naturally present on the surface of cells from among putative epitopes predicted using 
these algorithms has proven to be a difficuh and laborious process. The use of ECRs in an epitope 
identification process can enormously simpUfy the task of identifying discrete MHC bmdmg 
epitopes. 

In a preferred embodiment, ECR polypeptides are synthesized on an automated peptide 
25 synthesizer and these ECRs are then subjected to in vitro digests using proteolytic enzymes 
involved in processing proteins for presentation of the epitopes. Mass spectrometry and/or 
analytical HPLC are then used to identify the digest products and in vitro MHC bmding studies are 
used to assess the ability of these products to actually bind to MHC. Once epitopes contained in 
ECRs have been shown to bmd MHC, they can be incorporated into vaccines or used as 
30 diagnostics, either as discrete epitopes or in the context of ECRs. 

The use of an ECR (which because of its relatively short sequence can be produced 
through chemical synthesis) in this preferred embodiment is a significant improvement over what 
otherwise would require the use of whole protein. This is because whole proteins have to be 
produced usmg recombinant expression vector systems and/or complex purification procedures. 
35 The simplicity of using chemically synthesized ECRs enables the analysis and identification of 
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large numbers of epitopes, while greatly reducing the time and expense of the process as compared 
to other currently used methods. The use of a defined ECR also greatly simplifies mass spectrum 
analysis of the digest, since the products of an BCR digest are a small fiaction of the digest 
products of a whole protein. 
5 In another embodiment, nucleic acid sequences encoding ECRs are used to express the 

polypeptides in cells or cell lines to assess which epitopes are presented on the surface. A variety 
of means can be used to detect the epitope on flie surface. Preferred embodiments mvolve the lysis 
of the cells and afSnity purification of the MHC, and subsequent elution and analysis of peptides 
fit>m the MHC; or elution of epitopes from intact cells; (Falk, K. et al. Nature 351:290, 1991, and 
10 U.S. Patent 5,989,565, respectively). A sensitive method for analyzing peptides eluted in this way 
from the MHC employs capillary or nanocapillary HPLC ESI mass spectrometry and on-line 
sequencing. 

Target- Associated Antigens that Contain ECRs 

TAAs from which ECRs may be defined include those from TuAAs, including oncofetal, 

15 cancer-^estis, deregulated genes, fiision genes from errant translocations, differentiation antigens, 
embryonic antigens, cell cycle proteins, mutated tumor suppressor genes, and overexpressed gene 
products, including oncogenes. In addition, ECRs may be derived from virus gene products,, 
particularly those associated with viruses that cause chronic diseases or are oncogenic, such as the : 
herpes viruses, human p2q>illoma viruses, human immunodeficiency virus, and human T cell 

20 leukemia virus. Also ECRs may be derived from gene products of parasitic organisms, such as 
Trypanosoma^ Leishmanial and other intracellular or parasitic organisms. 

Some of these TuAA include a-fetoprotein, carcinoembryonic antigen (CEA), esophageal 
cancer derived NY-ESO-1, and SSX genes, SCP-1, PRAME, MART-l/MelanA (MART-1), gplOO 
(Pmel 17), tyrosmase, TRP-l, TRP-2, MAGE-1, MAGE-2, MAGE-3, BAGE, GAGE-l, GAGE-2, 

25 pi 5; overexpressed oncogenes and mutated tumor-suppressor genes such as p53, Ras, HER-2/neu; 
unique tumor antigens resulting from chromosomal translocations such as BCR-ABL, E2A-PRL, 
H4-RET, IGH-IGK, MYL-RARl and viral antigens, EBNAl, EBNA2, HPV-E6, -E?; prostate 
specific antigen (PSA), prostate stem cell antigen (PSCA), MAAT-1, GP-lOO, TSP-180, MAGE-4, 
MAGE-S, MAGE-6, RAGE, pl85erbB-2, pl85erbB-3, c-met, nm-23Hl, TAG-72, CA 19-9, CA 

30 72-4, CAM 17.1, NuMa, K-ras, -Catenin, CDK4, Mum-1, pl5, and pl6. 

Numerous other TAAs are also contemplated for both pathogens and tumors. In terms of 
TuAAs, a variety of methods are available and well known in the art to identify genes and gene 
products that are differentially expressed in neoplastic cells as compared to normal cells. 
Examples of these techniques include differential hybridization, including the use of microarrays; 

35 subtractive hybridization cloning; differential display, eidier at the level of mRNA or protein 
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expression; EST sequencing; and SAGE (sequential analysis of gene e3q)ression). These nucleic 
acid techniques have been reviewed by CaniUi, JJ>. et al., J. Cellular Biochem Suppl. 30/31:286- 
296, 1998. Differential display of proteins involves, for example, comparison of two-dimensional 
poly-aciylamide gel electrophoresis of cell lysates fix^m tumor and normal tissue, location of 
5 protein spots unique or overe5q)ressed in the tumor, recovery of the protein from the gel, and 
identification of the protein using traditional biochemical- or mass spectrometiy-based sequencing. 
An additional technique for identification of TAAs is the Serex technique, discussed m Tiireci, O., 
Sahni, U., and Pfreundschuh, M., "Serological analysis of human tumor antigens: molecular 
definition and implications". Molecular Medicine Today, 3:342, 1997. 
10 Use of fliese and other methods provides one of skill m the art the techniques necessary to 

identify genes and gene products contained within a taiget cell that may be used as potential 
candidate proteins for generating the epitopes of the mvention disclosed. However, it is not 
necessary, in practicing the mvention, to identify a novel TuAA or TAA. Rafter, embodiments of 
the mvention make it possible to identify ECRs from any relevant protein sequence, whetiier the 
IS sequence is afready known or is new. 

Protein Sequence Analysis to Identify Epitope Clusters 

In preferred embodiments of tiie invention, identification of ECRs involves two main 
steps: (1) identifying good putative epitopes; and (2) defining tiie limits of any clusters in which 
these putative epitopes are located. There are various preferred embodunents of each of tfiese two 
steps, and a selected embodiment for the first step can be freely combined with a selected 
embodunent for the second step. The methods and embodiments that are disclosed herein for each 
of tiiese steps are merely exemplary, and are not intended to limit the scope of the mvention in any 
way. Persons of skill in the art mil appreciate the specific tools tiiat can be applied to flie analysis 
of a specific TAA, and such analysis can be conducted in numerous ways in accordance with the 
25 invention. 

Preferred embodiments for identifying good putative epitopes include the use of any 
available predictive algorithm that analyzes the sequences of proteins or genes to predict bindmg 
afSnify of peptide fragments for MHO, or to rank putative epitopes according to predicted affinity 
or other characteristics associated witii MHC binding. As described above, available exemplary 

30 algoritimis for tiiis kind of analysis include tiie Rammensee and NIH (Parker) algorithms. 
Likewise, good putative epitopes can be identified by direct or indirect assays of MHC binding. 
To choose **good" putative epitopes, it is necessary to set a cutoff point in terms of the score 
reported by the prediction software or in terms of the assayed bindmg affinity. In some 
embodiments, such a cutoff is absolute. For example, the cutoff can be based on tiie measured or 

35 predicted half tune of dissociation between an epitope and a selected MHC allele. In such cases. 
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embodiments of the cutoflF can be any half time of dissociation longer than, for example, 0,5 
minutes; in a preferred embodhnent longer than 2.5 minutes; in a more preferred embodiment 
longer than 5 minutes; and in a highly stringent embodiment can be longer tiian 10, or 20, or 25 
minutes. In these embodiments, the good putative epitopes are those that are predicted or 
5 identified to have good MHC binding characteristics, defined as being on the desirable side of the 
designated cutoflF point. Likewise, the cutoff can be based on the measured or predicted binding 
afBnity between an epitope and a selected MHC allele. Additionally, the absolute cutoff can be 
simply a selected number of putative epitopes. 

In other embodiments, the cutoff is relative. For example, a selected percentage of the 

10 total number of putative epitopes can be used to establish the cutoff for defining a candidate 
sequence as a good putative epitope. Again the properties for ranking the epitopes are derived 
fi*om measured or predicted MHC binding; the property used for such a determination can be any 
that is relevant to or indicative of binding. In preferred embodiments, identification of good 
putative epitopes can combine multiple methods of ranking candidate sequences. In such 

15 embodiments, the good epitopes are typically those that either represent a consensus of the good 
epitopes based on different methods and parameters, or that are particularly highly ranked by at 
least one of the methods. 

When several good putative epitopes have been identified, their positions relative to each 
other can be analyzed to determine the optimal clusters for use in vaccines or in vaccine design. 

20 Hiis analysis is based on the density of a selected epitope characteristic within the sequence of the 
TAA. Hie regions with the highest density of the characteristic, or with a density above a certain 
selected cutoff^ are designated as ECRs. Various embodunents of the invention employ different 
chamcteristics for the density analysis. For example, one preferred characteristic is simply the 
presence of any good putative epitope (as defined by any appropriate method). In this 

25 embodhnent, all putative epitopes above the cutoff are treated equally in the density analysis, and 
the best clusters are those with the highest density of good putative epitopes per amino acid 
residue. In another embodiment, the preferred characteristic is based on the parameter(s) 
previously used to score or rank the putative epitopes. In this embodiment, a putative epitope with 
a score that is twice as high as another putative epitope is doubly weighted in the density analysis, 

30 relative to the other putative epitope. Still other embodiments take the score or rank into account, 
but on a diminished scale, such as, for example, by using the log or the square root of the score to 
give more weight to some putative epitopes than to others in the density analysis. 

Depending on the length of the TAA to be analyzed, the number of possible candidate 
epitopes, the number of good putative epitopes, the variability of the scoring of the good putative 

35 epitopes, and other factors that become evident m any given analysis, the various embodiments of 
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the inventioa can be used alone or in combination to identify tbose ECRs that are most useful for a 
given application. Iterative or parallel analyses employing multiple approaches can be beneficial 
in many cases. ECRs are tools for increased efficiency of identifying true MHC epitopes, and for 
eflBcient "packaging** of MHC epitopes into vaccines. Accordingly, any of the embodiments 
5 described herefai, or other embodiments that are evident to those of skill in the art based on this 
disclosure, are useful in enhancing flie efficiency of these efforts by using ECRs mstead of using 
complete TAAs in vaccines and vaccine design. 

Since many or most TAAs have regions with low density of predicted MHC epitopes, 
using ECRs provides a valuable methodology that avoids the inefficiencies of including regions of 
1 0 low epitope density in vaccines and in epitope identification protocols. Thus, useful ECRs can also 
be defined as any portion of a TAA that is not the whole TAA, wherein the portion has a higher 
density of putative epitopes tiian the whole TAA, or than any regions of the TAA that have a 
particularly low density of putative epitopes. In this aspect of the invention, therefore, an ECR can 
be any fragment of a TAA with elevated epitope density. In some embodiments, an ECR can 
15 mclude aregion up to about 80% of the length of the TAA. In a preferred embodiment, an ECR 
can include a region up to about 50% of the length of the TAA. In a more preferred embodhnent, 
an ECR can include a region up to about 30 % of the length of the TAA. And in a most preferred . 
embodhnent, an ECR can mclude a region of between 5 and 15% of the length of the TAA. 

In another aspect of the mvention, the ECR can be defmed in terms of its absolute length. 

20 Accordmgly, by this definition, ±e minimal cluster for 9-mer epitopes mcludes 10 amino acid 
residues and has two overlapping 9-mers with 8 ammo acids in common. In a preferred 
embodiment, the cluster is between about 15 and 75 amino acids in length. In a more preferred 
embodiment, the cluster is between about 20 and 60 amino acids in length. In a most preferred 
embodhnent, the cluster is between about 30 and 40 amino acids in length. 

25 In practice, as described above, ECR identification can employ a simple density function 

such as the number of epitopes divided by the number of amino acids spanned by the those 
epitopes. It is not necessarily required that the epitopes overlap, but the value for a single epitope 
is not significant. If only a smgle value for a percentage cutoff is used and an absolute cutojff m the 
epitope prediction is not used, it is possible to set a single threshold at this step to define a cluster. 

30 However, usmg both an absolute cutoff and carrying out the first step using different percentage 
cutoffs, can produce variations m the global density of candidate epitopes. Such variations can 
require further accounting or manipulation. For example, an overlap of 2 epitopes is more 
significant if only 3 candidate epitopes were considered, tfian if 30 candidates were considered for 
any particular length prpteki. To take this feature mto consideration, the weight given to a 

35 particular cluster can further be divided by the fraction of possible peptides actually bemg 
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considered, in order to increase the significance of the calculation. This scales the result to the 
average density of predicted epitopes in the parent protein. 

Shnilariy, some embodiments base the scoring of good putative epitopes on the average 
number of peptides considered per amino acid in the protein. The resulting ratio represents the 
5 factor by which the density of predicted epitopes in the putative cluster differs from the avemge 
density in the protein. Accordmgly, an ECR is defined in one embodiment as any region 
containing two or more predicted epitopes for which this ratio exceeds 2, that is, any region with 
twice the average density of epitopes. In other embodiments, the region is defined as an ECR if the 
ratio exiceeds 1 .5, 3, 4, or 5, or more. 

10 Considering the average number of peptides per amino acid in a target protein to calculate 

the presence of an ECR highlights densely populated ECRs without regard to the score/affmity of 
the individual constituents. This is most appropriate for use of score-based cutoffs. However, an 
ECR with only a small number of highly ranked candidates can be of more biological significance 
than a cluster with several densely packed but lower ranking candidates, particularly if only a small 

15 percentage of the total number of candidate peptides were designated as good putative epitopes. 
Thus in some embodhnents it is appropriate to take into consideration the scores of the individual 
peptides. This is most readily accomplished by substituting the sum of the scores of the peptides in 
the putative cluster for tiie number of peptides in the putative cluster in the calculation described . 
above. 

20 This sum of scores method is more sensitive to sparsely populated clusters containing high 

scoring epitopes. Because the wide range of scores (i.e. half times of dissociation) produced by the 
BIMAS-NDH/Paiker algorithm can lead to a single high scoring peptide dwarfing the contribution 
of other potential epitopes, the log of the score rather than the score itself is preferably used in this 
procedure. 

25 Various other calculations can be devised under one or another condition. Generally 

speaking, the epitope density function is constructed so that it is proportional to the number of 
predicted epitopes, their scores, their ranks, and the like, within the putative cluster, and inversely 
proportional to the number of ammo acids or flection of protein contained within that putative 
cluster. Alternatively, the function can be evaluated for a window of a selected number of 

30 contiguous amino acids. In either case the function is also evaluated for aU predicted epitopes in 
the whole protein. If the ratio of values for the putative cluster (or window) and the whole protein 
is greater than, for example, 1.5, 2, 3, 4, 5, or more, an ECR is defined. 
Analvsis of Target Gene Products For MHC Bmdmg 

Once a TAA has been identified, the protein sequence can be used to identify putative 

35 epitopes with known or predicted afiSnity to the MHC peptide binding cleft Tests of peptide 
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fragments can be conducted in vitro, or using the sequence can be computer analyzed to determine 
MHC receptor binding of the peptide fragments. In one embodfanent of the invention, peptide 
fragments based on the amino acid sequence of the target protein are analyzed for their predicted 
ability to bind to the MHC peptide binding cleft. Examples of suitable computer algorithms for this 
5 purpose include the Rammensee/SYFPEITHI and the NIH (Parker) sites referenced in the 
discussion of epitope discovery above. 

As an alternative to predictive algorithms, a number of standard in vitro receptor binding 
affinity assays are available to identify peptides having an affmity for a particular allele of MHC. 
Accordmgly, by the method of this aspect of the invention, the initial population of peptide 

10 fragments can be narrowed to include only putative epitopes haviug an actual or predicted affinity 
for the selected allele of MHC, Selected common alleles of MHC, and their approximate 
frequencies, are reported in the tables 3-5 above. 

It has been observed that predicted epitopes often cluster at one or more particular regions 
within the amino acid sequence of a TAA. The identification of such ECRs offers a simple and 

15 practicable solution to the problem of designing effective vaccines for stimulating cellular 
immunity. For vaccines in which immune epitopes are desired, an ECR is directly usefiil as a 
vacciue. This is because the unmune proteasomes of the pAPCs can correctly process the cluster, 
liberating one or more of the contained MHC-binding peptides, in the same way a cell having v 
immune proteasomes processes and presents peptides derived from the complete TAA. The cluster 

20 is also a useful a starting material for identification of housekeeping epitopes produced by the 
housekeeping proteasomes active in peripheral cells. 
Additional Considerations for Vaccine Design 

There exist numerous alleles of MHC I in the human population. Thus, in a preferred 
embodiment, vaccine design can take into account the MHC I genotype of the patient, so as to 

25 deliver epitopes having suitable binding affinities for a particular patient's MHC allele(s). Since a 
patient may be homozygous or heterozygous for the relevant locus, in some embodiments of the 
invention, epitopes optimal for a single MHC I allele are preferred, while m other embodiments, 
epitopes corresponding to different MHC alleles may be preferred. A partial list of major class I 
MHC types, each generally encoded by multiple alleles, and their approximate frequencies, are 

30 reported in Table 6. 
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HLAtype 


Freqnenqr jn mixed-race population 


Al 


30% 


A2 


47% 


A3 


23% 


All 


15% 


A24 


14% 


A29 


7% 


A26 


6% 


B7 


22% 


B8 


21% 


B14 


9% 


B18 


10% 


B27 


8% 


B35 


20% 


B44 


26% 


B62 


14% 


B60 


10% 


B51 


10% 



In yet another embodiment of the present invention, the pAPCs are provided with a 
5 housekeeping epitope and an epitope cluster. The epitope cluster is a peptide or nucleic acid 
sequence that contains or encckies at least two sequences having a known or predicted aflBnity to 
MHC L While it is preferable that Ihe housekeeping epitope be provided to the pAPCs in a state 
that is fuUy processed or as a precursor that is engineered in such a way so fliat it can be processed 
in the pAPC to be an effective housekeeping epitope, the immune epitope can be processed from a 

10 larger precursor by the pAPCs. This is because the immune proteasome is constitutively active in 
the pAPC, and is fiiilly competent to process an appropriate precursor of presumably any lengfii 
into a "correct** immune epitope. 

Potential epitopes are commonly but not always found m clusters in discrete segments of a 
TAA containing multiple epitopes for the purpose of providing an immune epitope. Simply 

15 providing the pAPC with a polypeptide containing a cluster of potential epitopes, or a nucleic acid 
encoding a cluster, or a recombinant organism expressing the cluster enables the pAPC to produce 
at least one appropriate immune epitope. Since epitope clusters generally contain potential 
epitopes for more than one class I MHC allele, in many embodiments a single cluster can be used 
to produce immune epitopes useful with more than one class I MHC allele, 

20 In a preferred embodiment, a patient is inoculated with a vaccine that includes 

housekeeping epitopes derived jfrom a selected TAA. The housekeeping epitope can be a 
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polypeptide or a nucleic acid encoding a polypeptide, or a recombinant organism engineered to 
express the discrete epitope. Beyond this '^niinimal** vaccine containing a housekeeping epitope 
(whether as a polypeptide, a nucleic acid, or recombinant organism), embodiments of the invention 
include vaccmes that additionally have one or more other housekeeping epitopes, or one or more 
5 itnmune epitopes, or any combmation tiiereof. Such epitopes can be derived from the same TAA, 
or they can be derived from different TAAs. 

A preferred embodiment of the present invention includes a method of administering a 
vaccine including a housekeeping epitope to induce a th^peutic immune response. The vaccine 
is administered to a patient in a manner consistent with the standard vaccme delivery protocols that 

10 are well known m the art. Methods of administering epitopes of TAAs include, without lunitation, 
transdermal, intranodal, perinodal, oral, intravenous, mtradermal, intramuscular, intraperitoneal, 
and mucosal admitiistration. A particularly usefril mediod of vaccine delivery to elicit a CTL 
response is disclosed in PCT Publication No. WO 99/01283, entitled "A METHOD OF 
INDUCING A CTL RESPONSE," filed on July 10, 1998. 

15 Because the epitope synchronization system has utility in inducing a cell-mediated 

inunune response, a vaccine to induce a specific T cell response to a target cell is likewise included 
in a preferred embodunent of the present invention. The vaccine contains a housekeeping epitope 
in a concentration effective to cause a pAPC or populations of pAPCs to display housekeeping 
epitopes. Advantageously, the vaccme can include a plurality of housekeeping epitopes or one or 

20 more housekeeping epitopes optionally in combination with one or more immune epitopes. 
Formulations of the vaccine contain peptides and/or nucleic acids in a concentration sufficient to 
cause pAPCs to present the epitopes. The formulations preferably contam epitopes in a total 
concentration of about l^ig-lmg/lOOjil of vaccine preparation. Conventional dosages and dosing 
for peptide vaccines and/or nucleic acid vaccmes can be used with the present invention, and such 

25 dosing regimens are well understood m the art In one embodiment, a single dosage for an adult 
human may advantageously be firom about 1 to about 5000 ^l of such a composition, administered 
one time or multiple times, e.g., in 2, 3, 4 or more dosages separated by I week, 2 weeks, 1 month, 
or more. In a particularly preferred embodiment, such a composition is administered continuously, 
directly into a lymph node, through the use of an insulin pump, at a rate of at least 1 \xl per hour 

30 over several days. Such administration can be repeated periodically to maintain the CTL response 
as is more fully described in PCT Publication No. WO 99/01283. 

The compositions and methods of the invention disclosed herein further contemplate 
incorporating adjuvants into the formulations in order to enhance tiie performance of the vaccines. 
Specifically, the addition of adjuvants to the formulations is designed to enhance the delivery or 

35 uptake of the epitopes by the pAPCs. The adjuvants contemplated by the present invention are 
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known by those of skill in the art and include, for example, GMCSF, GCSF, IL-2, IL-12, BCG, 
tetanus toxoid, and osteoponttn/ETA-L 

In a further embodiment, housekeeping epitope-reactive T cells can be administered to a 
patient as an adoptive immunotherapy. Such T cells can be most readily obtained by in vitro 
5 unmunization, usmg cells from a naive donor, though using the patient as donor can also be 
feasible. Techniques for m vitro immunization are known in the field, for example, Stauss et al, 
Proc, Natl. Acad. Set USA 89:7871-7875, 1992; SalgaUer et al. Cancer Res. 55:4972-4979, 1995; 
Tsai et al., J. Immunol 158:1796-1802, 1997; and Chung et al., J. Immunother. 22:279-287, 1999. 
Use of unmunized donors, or patients themselves, as initial sources of T cells is also contemplated. 
10 (See for example Oelke, M. et al. Clin Cancer Res. 6:1997-2005, 2000; Gervois, N, et al. Clin 
Cancer Res. 6:1459-1467, 2000; Vahnori, D. et al. Cancer Res. 59:2167-3173, 1999; Tsai, V. et al. 
Crit Rev. Immunol. 18:65-75, 1998; Matsunaga, K. et al. Jpn. J. Cancer Res. 90:1007-1015, 1999; 
van Elsas, A. et al. Eur J. Immunol. 26:1683-1689, 1996; and Alters, S.E. et al. Adv. Exp. Med 
Biol. 417:519-524, 1997). Once generated, sufficient numbers of such T cells can be obtained by 

15 expansion m vitro through stimulation witii the vaccines of this invention and/or cytokines (see for 
example Kurokawa, T. et al., Int X Cancer 91:749-746, 2001). These T cells can constitute a clone 
or a polyclonal population recognizing one or more epitopes. Typically, on the order of 10^ to 10^ 
cells are transferred m mice and 10 to 10 m humans. (See for example Drobyski, W.R. et al. 
97:2506-2513, 2001; Seeley B.M. et al. Otolaryngol Head Neck Surg. 124:436-441, 2001; 

20 Kanwar, J.R. et al. Cancer Res. 61:1948-1956, 2001; Plautz, GB. et al. Clin. Cancer Res. 6:2209- 
2218, 2000; Plautz, G.E. et al., /. Neurosurg. 89:42-51, 1998; and Plautz, G.E. et al.. Urology 
54:617-623, 1999). Clones and otherwise more enriched populations generally require the transfer 
of fewer cells. The epitopes recognized can be housekeeping epitopes or a combination of 
housekeeping and unmune epitopes. It is also envisioned that genetic engineering can be used to 

25 ejqpress cloned TCRs in a cell line suitable for use in adoptive immunotherapy. Examples of 
sources from which useful TCRs can be cloned include the T cells described above, and HLA- 
transgenic mice unmunized wifli the vaccines of this mvention. Additional variations will be 
apparent to one of skill in the art 

In some embodiments of the invention, the vaccines can mclude a recombinant organism, 

30 such as a virus, bacterium or parasite, genetically engineered to express an epitope in a host. For 
example. Listeria monocytogenes, a gram-positive, facultative intracellular bacterium, is a potent 
vector for targeting TuAAs to the immune system. In a preferred embodiment, this vector can be 
engineered to e3q)ress a housekeeping epitope to mduce therapeutic responses. The normal route 
of mfection of thiis organism is through the gut and can be delivered orally. In another 
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embodiment, an adenovirus (Ad) vector encoding a housekeeping epitope for a TuAA can be used 
to induce anti-virus or anti-tumor responses. Bone marrow-derived dendritic cells can be 
transduced with Jhe. virus construct and then injected, or fihe virus can be delivered directly via 
subcutaneous injection into an animal to induce potent T-ceU responses. Another embodiment 
5 employs a recombinant vaccmia virus engineered to encode amino acid sequences corresponding 
to a housekeepmg epitope for a TAA. Vaccinia viruses carrying constructs with the appropriate 
nucleotide substitutions m the form of a minigene construct can direct the expression of a 
housekeepmg epitope, leadmg to a therapeutic T cell response agamst the epitope. 

Particular^ useful nucleic acid constructs useful as vaccmes m accordance with die 
1 0 present invention are disclosed herein. 
Epitope-Bn codihg Vector Constructs 

The presoit invention provides nucleic acid constructs for use as therapeutic vaccines. The 
constructs mclude a coding region having a sequence that encodes a polypeptide. The polypeptide 
is an epitope of a TAA. In one embodhnent, the target ceU is a neoplastic cell and the polypeptide 
15 is an epitope or precursor of an epitope of a TuAA. In another embodhnent, the target ceU is any 
ceU in&cted with an intracellular parasite. The term "parasite" as used herein includes any 
organism or infective agent siich as a y'mts that has an mtracellular stage of mfection within the 
host. These mclude but are not Umited to: viruses such as adenovuiis, cytomegalovirus, Epstein- 
BaiT virus, herpes sunplex virus 1, herpes simplex virus 2, human herpesvirus 6, yaricella-zoster 
20 vfrus, hepatitis B vmis, hepatitis D vfrus, p^illoma virus, parvovirus B19, polyomavirus BK, 
polyomavinis JC, hepatitis C vmis, measles vfrus, rubella vhus, human unmunodeficiency virus 
(mVX human Tcell leukemia virus I, and human Tcell leukemia virus U; bacteria such as 
CMamydick Listeria, Salmonella, Legionella, Brucella, Coxiella, Rickettsia. Afycobacterium; and 
protozoa such as Leishmania, Trypanosoma, Toxoplasma, and Plasmodium. 
25 The polypeptide(s) encoded by the nucleic acid construct can mclude a housekeepmg 

epitope of a TAA. In preferred embodiments, the nucleic acid construct encodes a plurality of 
housekeeping epitopes. When fte construct encodes such a plurality, the multiple epitopes can all 
correspond to different segments of a single TAA, or they can correspond to different TAAs. In a 
preferred embodhnent, the nucleic acid construct contains a housekeepmg epitope and an immune 
30 epitope. In another preferred embodiment, die nucleic acid construct contams a housekeepmg 
epitope and an epitope cluster region. 

In preferred embodiments, wherein the construct of the vaccine encodes both a 
housekeeping epitope and an immune epitope, the vaccme can stimulate a cellular immune 
response agamst target cells presenting either epitope-that is, the unmune response can recognize 
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the housekeeping epitopes displayed initially by the target cells, and tiien can also recognize the 
immnne epitopes presented by the target cells after induction by IFN. 

Advantageously, the nucleic acid construct can further include a third or fourth sequence, 
or more, with such sequences encodmg a thhrd or fourth epitope, or additional epitopes, 
5 respectively. Such epitopes can be derived from a single TAA or from two or more different 
TAAs, and can be housekeeping or immune epitopes on any combination. The constructs can be 
designed to encode epitopes correspondmg to any other proteasome activities that may play a role 
in processing antigens in any target cell or pAPC. 

The encoded MHC epitopes are preferably about 7-15 amino acids in length, and more 

10 preferably, 9 or 10 amino acids in length. While the generally preferred peptide size for MHC I 
binding is 9 amino acids, shorter and longer peptides may also in some cases bind MHC I. 
Likewise, many peptides much longer than 9 amino acids can be trimmed by exopeptidases or 
other proteases resident m the cell, to produce fragments that bind MHC I very effectively. The 
size of a peptide contaming an unmune epitope sequence is not critical, so long as the sequence 

15 mcludes Ae epitope. This is because the immune proteasome, resident in the pAPC, in 
combmation with trimming exopeptidases and other proteases, in its normal function correctly 
processes full length TAAs to produce immune epitopes. Thus, the nucleic acid sequence 
encoding the immune epitope can actually encode a much larger precursor, including the complete , 
TAA. Such a construct preferably also encodes a housekeeping epitope. 

20 Examples of TuAAs and other TAAs suitable for use in the present invention include but 

are not limited to; differentiation antigens such as MelanA (MART-I), gplOO (Pmel 17), 
tyrosinase, TRP-1, TRP-2, and tumor-specific multilineage antigens such as MAGE-1, MAGE-3, 
BAGE, GAGE-1, GAGE-2, CEA, RAGE, NY-ESO, SCP-1, Hom/Mel-40 and FRAME. Similarly, 
TuAAs include overexpressed oncogenes, and mutated tumor-suppressor genes such as p53, H-Ras 

25 and HER-2/neu. Additionally, unique TuAAs resulting from chromosomal translocations such as 
BCR-ABL, E2A.PRL, H4-RET, IGH-IGK, MYL-RAR and viral antigens such as Epstem Barr 
vims antigens EBNA, and the human papillomavirus (HPV) antigens E6 and E7 are included. 
Other useful protein antigens mclude but are not limited to TSP-180, MAGE-4, MAGE-5, MAGE- 
6, RAGE, NY-ESO, pl85erbB2, pl80erbB-3, c-met, nm-23Hl, PSA, TAG-72-4, CAM 17.1, 

30 NuMa, K-ras, ^-Catenin, CDK4, Mum-1, and pl6. These and other TuAAs and pathogen-related 
antigens are known and available to those of skill in the art in the literature or commercially. 

In a further embodiment, the TAA is an antigen specific for a virus. See Table 2 above. In 
yet another embodiment of the present invention, the TAA is an antigen specific for a non-viral 
mtracellular parasite. Examples of parasite-specific antigens include nucleotides, proteins, or other 
35 gene products associated with the intracellular parasite . Suitable nucleotides or proteins can be 



-48- 



wo 01^2963 PCT/USOl/13806 

found at fee NCBI Taxonomy Database located at 
http://wwwjicbialinjiih.gov/Taxonomy/taxittnl/. More detailed descriptions of gene products 
for parasites and other pathogens are provided at this web site. 

Particularly preferred peptides are about 7-15 amino acids in length. An extensive listing 
5 of peptides having MHC binding motife is provided in Han-Georg Rammensee, Jutta Bachmann, 
and Stefen Stevanovic, "MHC Ligands and Peptide Motifs," Springer-Verlag, Germany, (1997) 
Landes Bioscience, Austin, Texas. 

The epitopes encoded by the constructs have affinity to one or more MHC I alleles. In 
some embodiments, wherein a patient is heterozygous for MHC I, the construct can encode 
10 epitopes corresponding to differoit MHC I alleles. 

Preferred nucleic acid ctmstructs include at least one promoter sequence that is operably 
Unked to tfie 5' end of the coding region of tiie construct. It will be appreciated by those of skill in 
the art that any promoter active m mammalian cells can be employed. Preferred promoter 
sequences include, but are not limited to, the CMV promoter, the SV40 promoter, and retroviral 
15 LTR promoter sequences, and can also include EF-IA, UbC, 6-actin promoters. In some 
embodunents, tiie constructs can include two or more promoters that are operably linked to the 5' 
end of different polypeptide-encoding sequences. Likewise, the constructs can employ enhancers, 
nuclear unport sequences, immunostimulatory sequences, and expression cassettes for cytokmes, 
selection markers, reporter molecules, and the like. Moreover, immunostimulatory, or other , 
20 modulatory sequences can be attached to the vector via a stably hybridized PNA peptide nucleic 
acid. In preferred embodunents, the nucleic acid constructs of the present invention also include a 
poly-A sequence that is operably linked to a 3' end of the coding region. A nucleic acid construct 
tiiat includes a nuclear unport sequence and an unmunostimulatory sequence is depicted in 
Figure 9. 

25 In certam embodunents, the nucleic acid constructs encode an mRNA that is translated as a 

smgle polypeptide and then cleaved. In one such embodiment the polypeptide consists of a linear 
airay of epitopes, wherem the fu-st (N-tenninal)^sequence is one or more immune epitopes or 
epitope clusters, and the second (C-terminal) sequence is a housekeeping epitope, such that the 
correct C-termmus of the housekeepmg epitope is specified by the termination codon, and all other 

30 HLA epitope termmi are determined by proteasomal processing and exopeptidase trunmmg. 

In another preferred embodunent, the nucleic acid construct encodes an amino acid 
sequence wherein an immune epitope or an epitope cluster is linked to a ubiquitin sequence. The 
ubiquitm sequence is sunilarly linked to a housekeepmg epitope. The presence of ubiquitin 
between the epitopes facilitates efiScient delivery of the unmune epitope to the proteasome for 

35 epitope processing. The ubiquitm sequence (with or without an N-termmal spacer to ensure the 
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integrity of the preceding peptide) is located in frame between the first and second sequence, or 
between any olher epitope-encoding sequences. The so produced Sequencel-Ubiquitin-Sequence2 
polypeptide is rapidly (co-translationally) cleaved at the Ubiquitin-Sequence2 junction by 
Ubiquitin-specific processing proteases, producing Sequencel-Ubiquitin and Sequence2. (See 
5 Figure 10) 

Physiologically, ubiquitin serves primarily as a signal that targets protein for degradation 
by the proteasome. It is among the most conserved proteins in eukaryotes, with only three 
conservative amino acid substitutions between yeast and human. Although the precise sequence of 
ubiquitin may vary somewhat, the sequence of the preferred embodiment is represented by SEQ ID 

10 NO: 5. Ubiquitm is a 76 amino acid long polypeptide having two crucial features: 1) a C-terminal 
Gly residue, involved in the conjugation of ubiquitin to the Lys side chain of protein substrates and 
2) a Lys residue, at position 48, for the formation of multi-ubiquitin chains. 

Ubiquitin genes are muque in the sense that all of them are syndiesized as fusions to other 
polypeptides, including other ubiquitms. In the yeast S. cerevisiae, four ubiquitin genes have been 

15 identified: \diereas the first three (UBIl-3) are fused to ribosomal proteins, the fourth gene (UBI4) 
is syntibiesized as a fusion of five identical repeats of the ubiquitin sequence. Thus, functional free 
ubiquitin is naturally produced after co-translational proteolytic processing by ubiquitously 
e^ressed ubiquitin-specific hydrolases. Such a natural organization has been exploited by 
generating C-terminal fusions between a smgle ubiquitin moiety and any desired polypeptide. 

20 Ubiquitin can exist m two conformations: the first one is described above and consists of a 

linear fusion of a single ubiquitin to any desired polypeptide, in which the C-termmal Gly of 
ubiquitin is linked, via a peptide bond to the N-terminal amino acid of tiie polypeptide of choice. 
The second involves flie conjugation of a ubiquitin moiety to a protein substrate, via a Gly-Lys 
bond formation. In this case, the COOH group of flie ubiqutin Gly is linked to the e (epsilon) side 

25 chain of a solvent exposed Lys of tiie substrate (or another ubiquitin moiety). The ubiquitin signal 
for the degradation of the substrate is associated with the second conformation. Thus, in the 
Sequencel-Ubiquitin-Sequence2 construct described above, Sequence2 typically is not targeted to 
the proteasome. Accordingly, the Sequence2 position is preferably used for a fully processed 
epitope, or one needmg only N-terminal trimming, typically a housekeeping epitope. The ubiquitin 

30 moiety remaining attached to Sequence 1 in the construct described above can be polyubiquitinated 
at Lys48, thereby targeting that fragment to the proteasome for processing, and resulting in the 
liberation of the epitope contained in Sequence 1. It should be noted that if more than two 
sequences are linked together in a linear array by ubiquitin moieties, generally only flie last 
sequence behaves in the manner of Sequence2; the processing of all of the upstream sequences 

35 resembles that of Sequencel. To the extent that the constructs described herein are expressed in 
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pAPCs, wherein the immune proteasome is predominantly active, correct expression of 
housekeepmg epitopes by these constructs benefits fcom the housekeeping epitopes being in the 
Sequence2 position, or a correspond position wherein the epitope does not require proteasomal 
processing in the pAPC. 

5 In yet another embodiment the nucleic acid constructs of the present invention may include 

autoproteolytic peptide-encoding sequences. Such sequences are located between the &st and 
second sequences or between any oAer epitope-encodmg sequences. Examples of such 
autoproteolytic sequences include the inteins; also included are the 30^'*' and 2A^ proteases of 
picomaviruses, including polioviruses and other enterovmises, rhinoviruses, cardioviruses, and 
10 apthoviruses, and the equivalent comoviridae proteases. These proteases catalyze the post- 
translational cleavage of the large precursor polyprotein made by this family of viruses. 

In one embodiment, the autocatalytic protem sequence is inserted between two or more 
epitopes. In a further embodiment, the sequence is inserted after two or more epitopes, but the 
cleavage signal is found between the epitopes such that they are cleaved into two or more fully 

15 functional epitopes. The type of protease is not important, it is only unportant that the appropriate 
cleavage signal be available for the correct processing of the epitopes. 

Because the cleavage sites and the sequences of the autocatalytic proteins are known 
(recently reviewed by Seipelt, J. et al., Virus Research 62:159-168, 1999) they can easily be used 
for construction of a vector which produces a polyprotein or biprotem. Briefly, 3C^ 

20 predominantly recognizes a Q-G site as a cleavage signal although other closely adjacent positions 
can be important. Also the 30^"^ of some of these viruses adhere less closely to this general 
pattern, providing for a greater degree of flexibility in design. The limitation imposed by these 
requirements is more formal than real, particularly if the protease is placed between die epitopes to 
be ejqpressed. In this arrangement an upstream unmune epitope can be liberated by proteasomal 

25 processing even if the viral protease fails to cleave its N-terminus. The key residues for cleavage 
at the C-terminus are mtemal to 30^"' itself, generally leaving just 1-4 residues, if any, to be 
removed by exopeptidase trimming from the N-terminus of a downstream housekeeping epitope. 
2A^^ can be used much the same way with the understanding that the cleavage site, while favormg 
G-P, is somewhat more variable among these viruses. It must also be considered that its e7q)ression 

30 can lead to a shutdown of host cell protein synthesis with a rapidity and completeness that depend 
on the virus strain from which it was derived. 

Strictly speaking, the 2A proteins from cardioviruses and apthoviruses (i.e., Foot-and- 
MouHi Disease Vhiis (FMDV)) are not proteases, but rather prevent peptide bond fomation at 
dieur C-termmi without causing a tennination of translation (Ryan, MX)., et al., Bioorganic 



-51- 



wo 01/82963 PCTAJSOl/13806 

Chemistry 27:55-79, 1999).Thus by positioning these 2A proteins between epitopes one can cause 
scission within a single reading ftame. The 2A protein from FMDV is very small, only 18 amino 
acids, making it particular^ well suited to multiple epitope expression. A plasmid employing the 
2A protein is depicted as Figure 1 1 . 

In certam other embodimentst, the nucleic acid constructs encode an mRNA that is 
translated as two or more polypeptides. In one such embodmient Ae transcript can contain one or 
more internal ribosome entry site (IRES) sequences that are located between the first and second 
sequence or between any other epitope-encoding sequences. IRES sequences are naturally used by 
picomaviruses to direct internal cap-independent translation of mRNA. Such IRES sequences can 
also allow independent translation of two or more consecutive open reading frames from the same 
messenger RNA. Although the IRES sequences of various constructs may vary, the IRES sequence 
of one preferred embodiment is. provided in SEQ ID NO: 6. ITie C-terminus of each epitope 
expressed is determined by termination codons. Thus Ihe oider of the sequences encoding the 
housekeeping epitope and the sequences encoding the immune epitope does not matter, which 
provides flexibility of plasmid construction. Optionally, the sequence encoding the housekeeping 
epitope can precede the IRES sequence and the sequence encoding the immune epitope can be 
linked to the other end of the IRES sequence. Such vectors can also usefully encode two or more 
housekeeping epitopes. They can further allow the combination of the various single polypeptide 
constructs described above, in order to productively express multiple epitopes. See Figure 12. 

In certam other embodiment^ the nucleic acid constructs encode two or more mRNA 
transcripts. Each of these transcripts may encode single epitopes or any of the dual or multiple 
epitope transcripts described m the embodunents above. Two or more transcripts can be the result 
of usmg multiple promoters. Those of skill m the art will recognize that use of more than one copy 
of a single promoter can lead to instability of the plasmid during propagation. Thus it will 
generally be preferable to use two (or more) different promoters. 

Two or more transcripts can also be the result of using bidirectional promoters. 
Bidirectional promoters can be found in a wide variety of organisms. Examples of such promoters 
include PDGF-A from human, pcbAB and pcbC from Penicilliwn chrysogenum, neurotropic JC 
virus, and. BRCAl from mouse, dog and human. Although intensive research on bidirectional 
promoters began comparatively recently, there is a growing body of information on the sequence, 
regulation, and other intricacies of how they work. For example, the human transcobalamin n 
promoter requu^s a 69 base pair (bp) fragment containing a GC box and an E box for full 
transcriptional activity. The dipeptidylpeptidase TV promoter was shown to stimulate transcription 
from both sides with a similar efficiency. Rat mitochondrial chaperonins 60 and 10 are Imked 
head to head and share a bidfrectional promoter. Accordingly, various working bidu^ctional 
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promoters have been identified, sequenced, and cloned in such a way that they can be used in a 
nucleic acid construct to express two genes. 

Thus, in a preferred embodiment, the nucleic acid constructs contain bidirectional 
promoters such as, for example, those listed above, linked to a nucleic acid sequence encoding a 
5 housekeeping epitope or precursor thereof In a particularly preferred embodiment; the nucleic 
acid construct contams bidirectional promoters linked to nucleic acid sequences encoding a 
plurality of housekeeping epitopes. In another embodiment, the nucleic acid constructs comprise 
bidirectional promoters linked to nucleic acid sequences encoding a housekeeping epitope and an 
immune epitope, or to an epitope cluster region. In addition, the bidirectional promoter may be 

1 0 positively or negatively regulated. 

When the nucleic acid construct contains more than one epitope, the bidirectional promoter 
may express the plurality of epitopes in comparable amounts or some may be expressed at higher 
levels than the others. Alternatively, one epitope can be inducible and the other constitutive. In 
this way, a temporal regulation of epitope expression can be achieved, wherein one epitope is 

1 5 expressed early in the treatment and the other expressed later. 
Analvsis of Epitope Expression 

Several methods described in the literature can be used to determine if epitopes have been 
presented on pAPCs. An indu-ect but powerful method is the use of class I tetramer analysis to. 
determine T cell frequency in an animal before and after administration of a housekeeping epitope. 

20 Clonal expansion of T cells in response to an epitope occurs preferentially only when the epitope is 
presented to T cells by pAPCs. Therefore, measurements of specific T cell frequency against the 
housekeeping epitope before and after administration of the epitope to an animal is a means of 
determining if the epitope is present on pAPCs. An increase in frequency of T cells specific to the 
epitope after administration indicates that the epitope was presented on pAPC. Other methods of 

25 determining T cell frequency such as limiting dilution analysis or ELISPOT can be used in 
principally the same manner to assess housekeeping epitope presentation by pAPCs. Similarly any 
method of determining T cell frequency m an animal may be used. 

A direct method for determining housekeeping epitope presentation on pAPCs involves the 
purification of pAPCs from an animal after administration of an epitope. After vaccination of an 
30 animal with an housekeeping epitope, pAPCs may be harvested from PBMC, splenocytes or lymph 
node cells, using monoclonal antibodies against specific markers present on pAPCs and affinity 
purification, such as with the use of monoclonal antibodies fixed to magnetic beads. The optimal 
tune for such harvest is variable, and can depend on the ammal vaccinated, the nature of the 
vaccme, and other factors including dosing, site of administration, pharmacokinetics, and the like. 
35 Crude blood or splenoctye preparation can be enriched for pAPCs using this technique. The 
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enriched pAPCs can then be used in a proliferation assay against a T cell clone that has been 
generated and is specific for the housekeeping epitope of interest. The pAPCs are comcubated 
with the T cell clone and the T cells are monitored for proliferation activity, such as by measuring 
the incorporation of radiolabeled thymidine by T cells. Proliferation indicates that T cells specific 
5 for the housekeeping epitope are bemg stimulated by Aat epitope on the pAPCs. 

The following examples are intended for illustration purposes only, and should not be 
construed as limiting the scope of the invention in any way. 
Examples 

Example 1. Proteolytic c haracterization of an HLA epitope as a housekeeping epitope or an 
10 hnmune epitope 

Using the procedures described below, a synthetic peptide of 13 amino acids or more is 
prepared, contaming tihe candidate HLA epitope centrally. Proteasomes are prepared from cells 
etxpressing each type of proteasome, for example red blood cells and Raji cells for housekeepmg 
and hnmune proteasomes, respectively. The peptide is digested with the proteasome preparations 
15 and the resultant fragments identified by mass spectrometry. If one of those Augments is co-C- 
terminal with the HLA epitope, and is produced in significant yield m the preparation containing a 
housekeepmg proteasome, then the HLA epitope is a housekeeping epitope. Similarly, if one of 
those fi-agments is co-C-terminal with the HLA epitope and is produced m significant yield by the 
immune proteasome, and is not produced in significant yield by the housekeeping proteasome, then 
20 fee HLA epitope is a immune epitope. 

A. Peptide Synthesis 

Synthetic or recombinant polypeptides are constructed which encompass the HLA epitope 
and at least two residues proximal to its termini. These residues added to the ends of a particular 
HLA epitope are to ensure that the proteasome complex encounters a processing environment 

25 shnilar to tiiat found within the cell, hence increasing the likelihood that it performs its proteolytic 
functions normally. Additional residues normally found proximal to the ends of the HLA epitope 
can be added if necessary to help increase the solubility of the peptides. 

Some HLA epitopes present solubility difficulties due to their high hydrophobicity. 
Certain peptides can be extremely difficuh to purify because they will not dissolve in normal 

30 chromatographic eluents, or they can be very difficult to use once purified because they will not 
dissolve m the digestion buffers. This problem can be avoided by carefully choosing which part of 
the sequence surrounding the HLA epitope to include in a particular peptide construct, or by 
extending the sequence as mentioned in the preceding paragraph. If there are no residues proximal 
to the ends of the HLA epitope that can help increase the solubility, a short hydrophilic sequence 
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can be added instead (e.g. -EAEAE). This is added at least three to five residues past the end of 
the HLA epitope to maintain a natural terminal cleavage site for the proteasome. 

In a preferred embodiment peptides are synthesized on an Applied Biosystems 433A 
Peptide Synthesizer using standard Fmoc solid phase synthesis methodologies. The synthesizer is 
5 equipped with a conductivity feedback monitoring system which allows for increased reaction 
times for sequences that contam stretches of residues that are difficult to deprotect and/or difficult 
to couple. After synthesis, the peptides are cleaved from their support with trifluoroacetic acid in 
the presence of appropriate scavengers, precipitated with ether, and then lyophilized. 

The crude peptides are tiien purified on a preparative diphenyl HPLC column after &st 
10 developing a gradient using a similar analytical diphenyl HPLC system. The major HPLC 
ftactions from the first preparative injection of the peptide aro analyzed by electrospray mass 
spectrometry to identify, the target compound. Hie corresponding peaks from subsequent 
injections are collected, pooled and lyophilized, and a sample is taken to verify retention time and 
chromatographic purity by analytical HPLC. These purified peptides are then ready for digestion 
15 by the proteasome preparation. 

B. Proteasome Assay 

Immune or housekeeping proteasome complexes are isolated as is described in detail in 
Example 2 below. 

Hie purified peptides are then dissolved in an appropriate buffer to a concentration of, 
20 about 1 mM and added to approximately 2 volumes of the proteasome preparations. Replicate 
digests are prepared: one for mass spectrometry analysis and one for HPLC analysis, and an 
additional digest is prepared using a positive control peptide to verify proper fimctioning of the 
proteasome preparation used. The following peptides are suitable for use as control peptides for 
immune proteasome assays: MLLAVLYCLLWSFQTS (SEQ ID NO: 7); 
25 HSYTTAEEAAGITILT\aLGVL (SEQ ID NO: 8); EAASSSSTLVEVTLGEVPAAESPD (SEQ ID 
NO: 9); EFLWGPRALVETSYVKVLHHMVKI (SEQ ID NO: 10); APEEKIWEELSVLEVFEGR 
(SEQ ID NO: 11); and ELMEVDPIGHLYIFAT (SEQ ID NO: 12). Underiined residues indicate 
proteolytic cleavage sites. Peptide FLWGPRALVETSYVK (SEQ ID NO: 13) is suitable as a 
control peptide for housekeeping proteasome assays. These are allowed to incubate in parallel at 
30 37^C for a period of time and then the digestion is stopped by the addition of dilute trifluoroacetic 
acid and the samples frozen on dry ice. One replicate and a positive control are sent for analysis 
using a Lasermat 2000 (Finnigan Mat, LTD, U.K.). Matrix Assisted Laser Desorption Ionization - 
Time Of Flight (MALDI-TOF) mass spectrometry, and the others are set aside for HPLC. 
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C. MALDI-TOF Mass Spectrometric Analysis of the Digest 

Analysis of the digests is conducted employing either "Peptide" software, (Lighthouse 
Data), or software available fipom TTiermoBioanalysis Ltd., U.K This software can generate the 
sequence and molecular weight of all the possible fragments that satisfy both requirements of 
5 having the correct C-terminus of any predicted epitope, and containing the full length of that 
epitope or longer. 

For example, if the HLA epitope encompassing peptide is of the sequence: 

AAML LAVIiYCIiLSEIA AAEEE, 

10 

where the underlmed sequence is the HLA epitope, then the program would identify all of the 
following sequences as being potentially useful, and would assign each a molecular weight 

AAML LAVLYCLLSBI 
15 AML LAVLYCIiLSBI 
ML LAVLYCIiLSEI 
L LAVLYCLLSEI 
I iAVLYGLLSE I 
AVLYCLIiSEI 

20 

If the MALDI-TOF results show that one or more of those molecular weights is 
represented in a digestion mixture, then the corresponding peptide is synthesized, purified, 
identified by mass spectrometry and then subjected to analytical HPLC to establish both a standard 
retention time and an approximate mass to peak area ratio. The reserve digest is then diluted in an 

25 appropriate solvent and injected using the same analytical HPLC method. If the digest gives a 
peak in good yield that has the same retention time as that of the standard, it is almost certain that 
it is due to the presence of that sequence in the digest. If there is any ambiguity due to the possible 
generation of other fragments that would give the same or similar mass spectrometry results, the 
suspect component can be collected and set aside for C-terminal sequencing to confirm identity. 

30 Example 2. Purification of proteasome complexes 
A. Proteasome complexes from blood cells 

Concentrated erythrocyte bags were obtained from a local blood bank, (HemaCare, Van 
Nuys, CA). The contents of each bag were poured mto 200 ml centrifuge tubes and washed 3 
times with PBS by centrifugation at 2000 RPM for 10 minutes at room temperature in a swinging 
35 bucket rotor of a Megafuge 2.0 (Heraeus, Southplainfield, NJ). After the last wash the samples 
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were pooled in one contains, to minimize variability among tubes, and tiien re-divided into several 
centrifuge tubes. The cells were centrifiiged again at 2000 RPM for 10 min. The residual PBS was 
aspirated. The pellet was stored at -70** C until use. 

B. Proteasome complexes from tumor cells 

5 Raji ceUs, a Buridtt's lymphoma cell line, were obtained from A7XJC, (American Type 

Culture Collection. Manassas, VA). The cells were grown using standard cell culture methods and 
stimulated with INF-Gamma (100-500U/ml) (Pharmingen, San Diego, CA). Expression of 
unmune proteasome subunits was confirmed separately by immunohistochemsitry on the culture, 
and SDS-PAGE on a sample of the ceU lysate. The cells wer« collected by centrifugation, washed 
10 with PBS and stored at -70* C untfl use. 

C. Further pro cessing of proteasome complexes 

Blood or lymphoma tumor cell pellets (frozen) were thawed in a 37°C bafli and ddHaO was 
added to each tube. The cell suspension was homogenized in a 40 ml Dounce homogenizer. 
Furflier, for the tumor cells, the cell homogenate was centrifiiged at 2000 rpm to remove cell 
15 debris. Ite supernatant was centrifiiged at 10,000 rpm at 4° C for 10 minutes and further 
cenlrifuged at 50,000 rpm at 4° C for 30 mmutes in a T-1270 rotor (Sorval, Newtown, CT). 

The homogenates were passed through filter paper to remove debris, and then pooled 
together. A 68% sucrose sohition was added to the pooled homogenate sample. An antibody- 
Sepharose preparation 'vras incubated with the homogenate for three hours at room temperature m a 
20 rotator. The suspension was centrifoged and washed 3X with TBS and fiirther thoroughly washed 
over vacuum funnel 6-8 X Proteasomes were eluted in TBS (pH 7.6) and the optical density of the 
eluate was measured. The proteasome preparation was dialyzed overnight at 4» C against 20mM 
Tris (pa 7.6) using cellulose membrane MWCO 1000. The next day the proteasome preparation 
was concentrated by ultrafiltration in a Millipore ULTRAFREE-15 centrifiigation device 
25 (Millipore, Danbury, CT). The proteasomes. at a concentration of 4mg/ml, were then aliquotted 
and stored at -20'» C until use. The proteasomes were tested for activity and specificity by 
digestion of a fluorogenic substrate or a control peptide yielding known fragments. The following 
peptides are suitable for use as control peptides for immune proteasome assays: 
MLLAVLYCLLWSFQTS (SEQ ID NO: 14); HSYTTAEEAAGITlLrVILGVL (SEQ ID NO: 15); 
30 EAASSSSTLVEVTLGEYPAAESPD (SEQ ID NO: 16); EFLWGPRALVETSYVKVLHHMVKI 
(SEQ ID NO: 17); APEEKIWEELSVLEVFEGR (SEQ ID NO: 18); and ELMEVDPIGHLYIFAT 
(SEQ ID NO: 19). Underlined residues indicate proteolytic cleavage sites. Peptide 
FLWGPRALVETSYVK (SEQ ID NO: 20) is suitable as a control peptide for housekeeping 
proteasome assays. 



35 
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D. Ouantitation and activity analysis of proteasome preparations 

An enzyme-linked immimosorbant assay (ELISA) was xised to quantitate the proteasome 
preparations described above. ELISA techniques are well known in the art and are discussed 
generally in Ausubel, et al., "Short Protocols in Molecular Biology," 3"* Ed., Unit 11.2 (1997). 
5 Hybridoma cells (MCP-21) producing a monoclonal anti-human proteasome antibody were 
obtained fix>m the European Collection of Cell Culture ((ECACC;, UK) and were maintained using 
standard cell culture techniques and equipment. Hybridoma supplement (Gibco BRL, Rockville, 
MD) was added to the antibody-producing cells. Upon reaching cell density of 500,000 cells/ml in 
an approximate volume of 2-3 liters, the cells were removed by centrifiigation and the supernatant 

10 was collected. Secretion of mAb m the medium was monitored periodically by optical density 
(OD.) using a Lambda 20 Spetrophotometer (Perikin Elmer, Norwalk, CT). 

The supernatant was passed over a protein G sepharose column (Amersham/Pharmacia 
Biotech Piscataway, NJ). The cohmin was washed with PBS and the antibody was eluted in a 
O.IM glycine buffer, pH 2.2. The optical density of the eluate fractions was measured at 280 nm, 

15 and the positive factions were collected. The antibody was dialyzed against 2L of PBS for. 2 days 
at 4^ C and stored until use. 

The antibody was bound to CNBr-activated Sepharose 4B (Amersham Pharmacia biotech, 
Piscataway, NY). The antibody-Sepharose complex was washed alternately 5-7 times with O.IM 
sodium acetate saline, pH 4 and O.IM sodium borate saline, pH 8 and finally suspended in Tris 

20 buffered saline (TBS), pH 8. The preparation was stored at 4° C until use. 

Example 3. Generation of predicted MHC I peptide cleft binding peptides usinp alg orithmic 
modeling 

A population of candidate MHC I binding peptides, generated from the amino acid 
sequence of human carcinoembryonic antigen precursor (CEA) (GENBANK ACCESSION 

25 P06731), was produced using an algorithm. The particular algorithm is available at 
«http://134.2.96.221/scripts/hlaserver.dll/EpPredicthtm», as discussed above. Once the 
algorithm was accessed, the ammo acid sequence for CEA was provided. Next, parameters for the 
length of the epitope (decamers) and the particular MHC allele (H2-Db) of interest were selected. 
Following this, die data were submitted for algorithmic analysis. The resulting data are shown in 

30 Table?. 
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TABLE? 

Fragments of CEA having Predicted AfBnity for H2-Db 



POS 


1234567890 


SCO 
re 


Seq 

Id 

no 




POS 


1234567890 


See 
re 


Seq 

Id 

no 


D4t / 


Liyij SNGNRTIi 


26 


21 




496 


VSAELPKPSI 


16 


51 


ICQ 


LQIjSNDNRTL 


26 


22 




465 


SNITEKNSGL 


16 


52 




JjQIiSNQNRTL 


26 


23 


453 


GNIQQHTQEL 


16 


53 




LLVHNLPQHIi 


26 


24 


441 


SNPPAQYSWL 


16 


54 


•St J. 


IiSwDNRTliTL 


25 


25 


393 


CGIQNELSVD 


16 


55 


c A a 

549 


LSNGNRTLTL 


24 


26 


242 


ISPLNTSYRS 


16 


56 


X93 


LSNGNRTLTL 


24 


27 


91 


PGPAYSGRBI 


16 


57 


299 


CQAHNSDTQL 


23 


28 


43 


PNVAEGKEVL 


16 


58 


100 


IIYPNASLLI 


21 


29 


693 


IGVLVGVALI 


15 


59 


578 


SANRSDPVTIi 


19 


30 


684 


SAGATVGIMI 


15 


60 


576 


SVSANRSDPV 


19 


31 


510 


SKPVEDKDAV 


15 


61 


504 


SrSSNNSKPV 


18 


32 


482 


SASGHSRTTV 


15 


62 


356 


WWV'NNQSIiPV 


18 


33 


428 


RPGVNLSLSC 


15 


63 


178 


WWVNNQSLPV 


18 


34 




399 


LSVDHSDPVI 


15 


64 


148 


SISSNNSKPV 


18 


35 


372 


SNDNRTLTLL 


15 


65 


127 


SDIiVNBEATG 


18 


36 


332 


SNPVEDBDAV 


15 


66 


645 


ITPNNNGTYA 


17 


37 


329 


SNNSNPVEDB 


15 


67 


540 


SLPVSPRLQL 


17 


38 


307 


GLNRTTVTTI 


15 


68 


362 


SLPVSPRLQL 


17 


39 


289 


ITVNNSGSYT 


15 


69 


326 


FITSNNSNPV 


17 


40 


280 


STQELFIPKTI 


15 


70 


250 


RSGENLNIiSC 


17 


41 


277 


FQQSTQELPI 


15 


71 


184 


SLPVSPRLQL 


17 


42 


222 


SARRSDSVIL 


15 


72 


140 


VYPBLPKPSI 


17 


43 


221 


VSARRSDSVI 


15 


73 


40 


STPFNVAEGK 


17 


44 


154 


SKPVEDKDAV 


15 


74 


29 


NPPTTAKLTI 


17 


45 




135 


TGQPRVYPBL 


15 


75 


555 


CFVSN-LATGR 


16 


46 




49 


KEVLLLVHNL 


15 


76 


608 


GANLNLSCHS 


16 


47 




34 


AKLTIBSTPF 


15 


77 


606 


LSGAWLNLSC 


16 


48 








604 


SYLSGANLNL 


16 


49 




571 


CGIQNSVSAN 


16 


50 



The table above arbitrarily cuts oflF scores below 15. The algorithm can produce scores of 
less than IS. 

Example 4. Digestion of peptide precursors using immune and housekeeping proteasomes to 
determine fragmetifa produced by proteolvtic digestion 

Peptides were synthesized using a 433 A ABI synthesizer. Peptides were produced in 0.25 
mmole quantities using Fastmoc chemistry. The peptides were tested for solubility and once 
solubilized, a 2mM solution was prepared and divided into - 25-30jiL aliquots which were stored 
at -20°C for future use. Timed digest reactions, typically consistuig of 2jil of peptide and 4nl of 
proteasome, were conducted with t=0 as a control and an incubation of the peptide with water 
instead of the proteasome as a further control. The reaction was carried out at 37* C and ended by 



-59- 



wo 01/82963 PCT/USOl/13806 

the addition of 10% TFA (trifluroacetic acid) on dry ice. The frozen samples were then analyzed 
by MALDI-TOF mass spectroscopy (MS) as described in Example 5, below. 

An optional desalting step can be performed on the digests prior to MS analysis using the 
ZIP-HP mefliod (MUlipore, Boston, MA). The ZIP TIP is a specially designed pipet tip which 
5 contams a bed of spherical silica resin. The sample is boraid to the tip, which is pre-equilibrated 
with 0.1% TFA, and then eluted with 50% Acetonitrile 0.1% TFA elution buffer. 
Example 5. Identification a nd quantitation of relevant proteolytic fragments by HPLC and mass 
spectrometry 

A. Identifying sequences of therapeutic interest 
10 The amino acid sequence of a protem of mterest is entered mto a computer, and the 

algorithm of Rammensee, et al., is used to generate 9- or 10-amino-acid-long sequences predicted 
to bmd a particular HLA receptor. The algorithm also ranks these predicted epitopes according to 
how well they match the bmdmg motif. 

Synthetic peptides contaming the sequence of the identified potential epitopes are then 
15 constructed to encompass the epitope candidate sequence and at least 3-5 residues proximal to its 
termmi. The residues added to the ends of a particular epitope candidate are to ensure that the 
proteasome complex encounters a processmg envhonment sunilar to that found within the cell, 
hence mcreasing the likelihood that it performs its proteolytic fimctions normally. Additional 
residues normally found proximal to the ends of the epitope candidate may be added if necessary to 
20 help mcrease the solubility of the peptides. 

Peptides are synthesized on an Applied Biosystems 433A Peptide Synthesizer (Applied 
Biosystems, Norwalk, CT) using standard Fmoc solid phase synthesis methodologies. The 
syntiiesizer is equipped with a conductivity feedback monitormg system which allows for 
mcreased reaction times for sequences that contam stretches of difficult to deprotect and difficult 
25 to couple residues. After synthesis, the peptides are cleaved from their support with trifluoroacetic 
acid in the presence of appropriate scavengers, precipitated with ether and then lyophilized. 

Ihe crude peptides are then dissolved m a suitable solvent at 0.5 mg/mL Five microliters 
(5pl) of this solution is then analyzed on a Shimadzu analytical reverse phase HPLC system 
(Shunadzu Scientific Instruments, Cohmbia, MD) using a 0.1% TFA water - acetonitrile gradient 
30 Typically, a C-IS silica column (Machery-Nagel # 720051.40, (Machery-Nagel GmbH, Germany)) 
is used for hydrophillic and a phenyl silica column (Vydac # 219TP5415 (The Separations Group, 
Inc., Hesperia, CA)) is used for hydrophobic peptides. The gradients used vary from 0-40% 
acetonitrile for hydrophillic to 30-70% acetonitrile for hydrophobic peptides. The peptides are 
subsequently purified on a Varian Prostar HPLC system (Varian, Lie, Palo Alto, CA) using sfanUar 
35 gradients and semi-^preparative versions of the above-mentioned columns (Machery Nagel # 
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715802, and Vydac 219TP510). The major HPLC fractions from the first preparative injection of 
the peptide are analyzed using a MALDI-TOF mass spectrometer to identify the desired 
component. The corresponding peaks from subsequent injections are collected, pooled and 
lyophiiized, and a sample is taken to verify retention time and chromatographic purity by analytical 
5 HPLC usmg the system described above. These purified peptides are then ready for digestion by 
the proteasome preparation. 

B. Proteasome assay 

Immune or housekeeping proteasome complexes are isolated by the method of Levy, 
(Morel, S., et al.. Immunity 12:107-117 (2000), and the references cited therein) described above. 

10 The purified peptide is dissolved in an appropriate buffer to a concentration of about 1 to 2 mM 
and added to approximately 2 volumes of the proteasome preparation. The buffer chosen must 
solvate the. peptide without interfering with the digestion process. An additional digest is prepared 
using the positive control peptide described above to verify proper functioning of the proteasome 
preparation used. These are incubated at 3TC for periods of up to 120 minutes and then the 

15 digestion is stopped by the addition of dilute trifluoroacetic acid; the samples are analyzed 
immediately by mass spectrometry, or they are frozen on dry ice until analysis. The digest reaction 
can also be halted by putting samples on ice for immediate analysis by mass spectrometry. 

C, MALDI-TQF mass spectrometric analysis of the digest 

Approximately 0.5 jil of each digest was mixed witti an equal volume of the matrix 
20 solution (10 mg/ml dihydroxybenzoic acid in 70% EtOH, pH 2-3) dbectly on the sample slide and 
allowed to aur dry at about 40**C. The samples were then analyzed on a Lasermat™ MALDI-TOF 
mass spectrometer (Thermo Bioanalysis, Santa Fe, NM)that was calibrated with suitable molecular 
weight standards. 

The computer programs (ei&er "Peptide" software, (Lighthouse Data), or "Dynamo" 
25 (ThermoBioanalysis Ltd., U.K)) developed for the proteasome assay generates the sequence and 
molecular weight of all the possible fragments that satisfy both requirements of having the correct 
C-terminus of any predicted epitope, and of containing the full length of that epitope or longer. 

When the MALDI-TOF results showed that a particular molecular weight was represented 
m a digestion mixture, the corresponding peptide was synthesized, purified, identified by MALDI- 
30 TOP and then subjected to reverse phase analytical HPLC to establish a standard retention time 
and an approximate mass to peak area ratio. These procedures are directly analogous to those 
described above. A replicate proteasome digest was then diluted in an appropriate solvent and 
analyzed usmg the same analytical HPLC method. When the digest gives a peak in good yield that 
has the same retention time as that of the standard, it is ahnost certain that it is due to the presence 
35 of that sequence in the digest When there is any ambiguity due to the possible generation of other 
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fragments that would give rise to the same or similar mass spectrometry results, the suspect 
component can be collected and set aside for sequencing to confinn identity. The analytical HPLC 
also importantly provides relatively accurate quantitation of the peptide product in the digest, 
which allows determination of whether a given peptide is a minor or a major product of the digest, 
5 which indicates whether the epitope is efficiently produced by the proteasome. Using the above 
meAod, housekeeping epitopes were identified. Figure 13 shows the results of a flow cytometry 
assay to verify HLA binding by these epitopes. This assay is discussed in Example 6. 
Example 6. Deter mine Ifae MHC binding ability of selected peptides 

Binding of a candidate epitope to HLA-A2.1 was assayed according to the method of 

10 Stauss et al., (Proc Natl Acad Sci USA 89(17):7871.5 (1992)). T2 cells, which express empty or 
unstable MHC molecules on their surface, were washed twice and suspended at 5x10^ cells/ml in 
serum-free complete Iscove^s modified Dulbecco's medium (IMDM). microglobulin (Sigma, 
St. Louis, MO) was added at S^ig/ml and the cells distributed to a 96-well U-bottom plate at 5x10^' 
cells/weU. Peptides were added at 100, 10, 1 and O.lng/ml. The plate was rocked gently for 2 

15 minutes and then incubated for 4 hours in a 5% COj incubator at 37^C. After the unbound peptide 
was removed by washing twice with IMDM, a saturating amount of monoclonal antibody W6/32 
(Sigma) was added. After incubation for 30 minutes at 4°C, cells were washed with PBS 
supplemented with 1% heat-inactivated PCS, 0,l%(w:v) sodium azide, pH 7.4-7.6 (staining 
buffer), and incubated with fluorescein isothiocyanate (FITC)«conjugated goat F(ab') antimouse- 

20 IgG (Sigma) for 30 min at 4°C and washed four times as before. The cells were resuspended in 
stainmg buffer and fixed by adding a quarter volume of 2% paraformaldehyde. The analysis of 
surface HLA-A2.1 molecules stabilized by peptide bmding was performed by flow cytometry 
using a FACScan (Becton Dickinson, San Jose, CA). 

Ihe results of the experiment are shown in Figure 14. Using the method discussed above, 

25 a candidate tyrosinase housekeepiag epitope identified by proteasomal digestion, (tyrosinase 207- 
216, FLPWHRLFLL SEQ ID NO: 78) was found to bind HLA-A2.1 to a similar extent as the 
known A2.1 binder FLPSDYFPSV (SEQ ID NO: 79) (positive control). HLA.B44 binding peptide 
AEMGKYSFY (SEQ ID NO: 80) used as a negative control. The fluorescence obtained fi-om the 
negative control was similar to the signal obtained when no peptide was used in the assay. Positive 

30 and negative control peptides were chosen fi-om Table 18.3.1 in Current Protocols in Iinmunology 
p. 1 8.3 .2, John WUey and Sons, New York, 1 998. 

Example 7. Elution of HLA epitopes fi-om tumors, tissu e samples, immortalized cell lines, or 
tumor cell lines 

Rather than generating HLA epitopes with in vitro proteolysis, they can be identified after 
35 elution from the HLA of tumors, tissue samples, tumor cell Imes or other immortalized cell lines 
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using mass spectrometry meAods. While a variety of such methods can be used, one of the most 
powerful methods of identifying epitopes from the surface of cells involves capillary or 
nanocapillary HPLC ESI mass spectrometry and on-line sequencmg, as described in the published 
literature. Elution procedures for solubilized HLA and intact cells are also described in Falk, K. et 
5 al. Nature 351:290, 1991 and in U.S. Patent 5,989,565, respectively. Not described in the 
literature, however, is the need to identify the type proteasome expressed in the cells undergoing 
peptide elution and analysis, so as to determine if the epitopes identified are housekeepmg 
epitopes, which are needed to make effective vaccines. To definitively identify the HLA epitope 
as either a housekeeping or unmune epitope one generally must know which proteasome the source 
10 cells express. Proteasome expression can be assessed preferably by western blottmg, which is 
described in detail below, and can also be assessed by RT-PCR, hnmunohistochemistiy, or in situ 
hybridization. 

Examples. IFN induction test 

Another assay to distinguish between housekeepmg epitopes and immune epitopes is to 

15 test the ability of anti-peptide CTL to kiU cells expressmg the TAA m question. IFN can be used 
to mduce expression of the unmune proteasome (assummg it is not already constitutively 
expressed) and CTL recognition of the mduced and unmduced cells can be compared. As above, 
proteasome type should be confirmed, e.g., by western blotting. If the IFN-induced cells are killed 
preferentially, the peptide constitutes an immune epitope. If the non-mduced cells are killed 

20 preferentially, the peptide constitutes a housekeeping epitope. Some epitopes can be produced by 
both proteasomes at dififermg efficiencies, and in such cases cytolytic activity is observed against 
both populations. Such epitopes are classified as housekeeping epitopes since they are present on 
peripheral target cells. 

Example 9. The use of huma n peripheral blood mononuclear cells fPBMCs^ or tumor infiltratin g r 

25 lymphocyte s f TELs^ to identify housekeeping ep itopes 

TILs isolated from patient biopsies, or PBMCs from blood of donors or patients can be 
used to identify housekeeping epitopes using methods that are commonly described m the 
published literature. To identify housekeeping epitopes, the target cells used to test for active 
killing by PBMCs or TDLs are confirmed to express only the housekeeping proteasomes, and not to 

30 express at significant levels the immune proteasome. PBMCs from donor blood are stimulated in 
vitro using a panel of peptide antigens with predicted afRnity for the class I HLA allele expressed 
on the blood cells being used. Each PBMC sample is stimulated with a specific class I peptide 
antigen for one week, preferably with the combination of cytokines such as IL-2 or IL-12 to 
enhance the activity of the T cells. This stimulation is repeated at least three times to mduce clonal 

35 expansion of T cells specific agamst the peptide. A standard chromium release assay is performed 
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using target cells that are known to express the protein containing the epitope and exclusively the 
housekeeping proteasome. Evidence of killmg of Ihe target cells as measured by chromium release 
indicates that the peptide used to stimulate the PBMCs is present as a housekeeping epitope on the 
surface of the target cell. Tumors expressing this protein are thus candidate targets for a vaccine 
5 containing the epitope. 

Example 10. Identification of housekeepi ng and/or immune oroteasomes bv westem blotting 

Bolh of the following protocols start with a membrane onto which proteins extracted from 
cells of interest have been transferred after electrophoretic separation. 
A. Chromogenic protocoh 
10 1. Wash the mmibrane for 5 min in 20 ml PBS-T (phosphate buffered saline, pH 7.4 + 0.1% 

Tween-20) at room temperature on an orbital shaker (RT/shaker). 
PBS (Sigma, Cat No. P-3813) 
(Volumes may vary with type of container throughout). 

2. Incubate the membrane for 5 min in 20 ml PBS-T, 3% H^Oj at RT/shaker: 
15 2 ml 30%H2O2+l 8ml PBS-T 

3. Wash the membrane 3x5 min with PBS-T at RT/shaker. 

4. Block overnight in 20ml PBS-T/5% nonfat dry milk at 4^C/shaker: 
20ml PBS-T+lg milk 

5 . Rinse the membrane m PBS-T. 

20 6. Incubate the membrane m 5ml of primary antibody (AfSnily Research Products Ltd, 

United Kmgdom) in blocking buffer for 2hrs at RT/shaker: 

a-LMP 2 antiserum (mouse) (Cat. No JPW8205) i :5000 

a-LMP 2 antiserum (human) (Cat No. PW 8345) 1:10000 
a-LMP 7 antiserum (Cat No. PW 8200) 1 :20000 

25 ot-20S proteasome a2 subunit monoclonal antibody (Cat No. PW 8105) 1:1000 

These conditions are for the preceding antibodies only. Conditions for eveiy 
antibody must be determined empirically. 

7. Wash the membrane as in step 3. 

8. Incubate the membrane in 5 ml of secondaiy antibody (Vector Laboratories, Inc., 
30 Burlingame, CA) m blocking buffer for 30 min at RT/shaker: 

GARB (Goat anti Rabbit) (for antisera) (Vector Labs Cat No. BA-1000) 1 :2000 
Horse anti mouse (for monoclonal antibodies) (Vector Labs Cat No. BA-2000) 1:1000 

9. Wash the membrane as in step 3. 

10. Incubate the membrane in 5 ml of ABC (Vector Laboratories, Cat No. PK-6100) in PBS-T 
35 for 30 min: 
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Make ABC at least 30 min before using as follows: 

A=5ul/lml=25ul/5ml 

B=5ul/lml = 25ul/5ml 

5ul A+5ul B >mix:>let stand at 4 Oadd 990ul PBS-T 
5 DUute ABC in PBS-T just before using 

11. Wash the membrane as in step 3. 

12. Detection: 

1) transfer 5 ml of0.2MPB into a l^hs ml tube 
0.4M Phosphate buffer: 

10 90.4 ml of Sodium Phosphate Monobasic (IM) 

6 1 9.2 ml of Sodium Phosphate Dibasic (0.5M) 
pHto7.4 
QStolL 

2) transfer 2.8 ml of 0.2MPB into 32*^^15 ml tube 
15 3) transfer 2 ml of 1% Glucose into 3 3^^15 ml tube 

4) weigh 6mg of ANS (Ammonium Nickel Sulfate) and transfer it into 1^* 15ml tube; 
vortex 

5) add 110 1 of Glucose Oxidase (Sigma, Cat No.G-6891) into an eppendorf tube 

6) add 110 1 of DAB substrate (Diaminobenzidine HCl, KPL, Maryland Cat. No.71- 
20 00-46) into another eppendorf tube 

7) Mix in the hood: 5ml PB+ 2ml Glucose 

+1101GO 

+1101 DAB 

+ 2.8ml0.2MPB 

25 13. Apply detection mixture on the membrane and set up timer. Record length of incubation in 

chromogen. 

14. After bands became visible enough wash the membrane 3 tunes with 02M PB. 

1 5. Shake in PBS overnight at RT. 
B. Chemiluminescence protocol: 

30 1 . Rinse the membrane twice in TBS-T (Tris-buffered salme pH7.6 + 0.1% Tween-20). Tris- 

buffered saline: 2.42g Tris base (20mM) 

8g sodium chloride (l37mM) 
3 . 8ml IM hydrochloric acid 
2. Block overnight in 20ml of blocking buffer (TBSrT/5% nonfat dry milk) 
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4 **C/shaker: 
20ml TBS-T+lg milk 
Volumes depend on type of container 
3. Rinse the membrane twice with TBS-T. 
5 4, Incubate tfie membrane in 5ml of primary antibody (Affinity Research Products Ltd, 

United Kmgdom) in blockmg buffer for 2hrs at RT/shaken 

a-LMP 2 antiserum (mouse) (Cat NoJPW8205) 1 :5000 

a-LMP 2 antiserum (human) (Cat No. PW 8345) l : i OOOO 

a-IMP 7 antiserum (Cat No. PW 8200) 1 :20000 

10 a-20S proteasome a2 subunit monoclonal antibody (Cat No, PW 8105) 1 :1000 

5. Wash the membrane in 20ml of TBS-T at RT/shaker: 

Briefly rinse the membrane using two changes of TBS-T then wash once for 15 mmutes 
and twice for 5 minutes with fresh changes of the washmg buffer at room temperature. 

6. Incubate Ae membrane m 5 ml of HRP labeled (Horseradish peroxidase-labeled) 
15 secondaiy antibody (Amersham; Cat# NIF 824 or NIF 825) 1:1000 dilution in blocking 

buffer for Ih at RT/shaker 

7. Wash the membrane as in step 5. 

8. Mix an equal volume of detection solution 1 (Amersham, Cat#RPN2109) and detection 
solution 2 (Amersham, CatifRPN2 109) (Iml+lml). 

20 9. Drain the excess buffer from the washed membrane and put it on a piece of Saran Wrap, 

protein side up. Add the detection reagent to cover the membrane. 

10. Incubate for 1 minute at room temperature without agitation. 

11, Drain off excess of detection reagent and transfer the membrane to Kodak Digital Science 
Image Station 440CF protein side down. Develop and quantify the signal according to the 

25 manufacturers' instructions. 

The presence of housekeeping-specific subunits (in either protocol) is directly assessed 

using: 

a-61 (Y) subunit monoclonal antibody (Cat No. PW 8140) 1:1000 
a-B2 (Z) subunit monoclonal antibody (Cat No. PW 8145) 1:1000 

30 (AfSnity Research Products Ltd, United Kmgdom). 

Example 1 1 . Preparation of a housekeeping epitope peptide vaccine 

A sequence identijRed to be a housekeeping epitope is synthesized using a commercial 
peptide synthesizer. Peptides of mterest are formulated in different ways and administered alone, 
or m combination with adjuvants, such as CPA, IFA, or melacme, or with cytokmes, such as IL-2, 

35 IL-12, or GM-CSF in order to achieve the effect of stunulatmg T cells against the epitope in 
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animals. Peptides are also formulated with controlled release substances, such as PLGA 
microspheres or otiher biodegradable substances, which alter the pharmacokinetics of the peptide 
and can also improve immunogenicity. Peptides are also formulated for oral delivery using such 
substances to facilitate priming of the immune response through uptake into GALT (gut-associated 
5 lymphoid tissues). Peptide are also adhered to minute gold particles so that they can be delivered 
using a "gene gun." 

A. Synthesis of GMP-erade pep tides 

Peptides are synthesized using either FMOC or tBOC solid phase synthesis metiiodologies. 
After synthesis, the peptides are cleaved fcom their supports with either trifluoroacetic acid or 

10 hydrogen fluoride, respectively, in the presence of appropriate protective scavengers. After 
removmg the acid by evaporation, the peptides are extracted with ether to remove the scavengers 
and the crude, precipitated peptide is then lyophilized. Purity of the crude peptides is determined 
by HPLC, sequence analysis, amino acid analysis, counterion content analysis and other suitable 
means. If the crude peptides are pure enough (greater tfian or equal to about 90% pure), they can 

15 be used as is. If purification is required to meet drug substance specifications, the peptides are 
purified using one or a combination of the following: re-precipitation; reverse-phase, ion 
exchange, size exclusion or hydrophobic interaction chromatography; or counter-current 
distribution. 

B. Drug product formulation 

20 GMP-grade peptides are formulated in a parenterally acceptable aqueous, organic, or 

aqueous-organic buffer or solvent system in which tiiey remain both physically and chemically 
stable and biologically potent. Generally, buffers or combinations of buffers or combinations of 
buffers and organic solvents are appropriate. The pH range is typically between 6 and 9. Organic 
modifiers or other excipients can be added to help solubilize and stabilize the peptides. These 

25 include detergents, lipids, co-solvents, antioxidants, chelators and reducing agents. In the case of a 
lyophilized product, sucrose or mannitol or other lyophilization aids can be added. Peptide 
solutions are sterilized by membrane filtmtion into their final container-closure system and either 
lyophilized for dissolution in the clinic, or stored until use. 
Example 12. Deliverv of a housekeeping epitope peptide vaccine 

30 A. Intranodal deliverv 

A formulation containing peptide in aqueous buffer with an antimicrobial agent, an 
antioxidant, and an immunomodulating cytokine, was injected continuously over several days into 
the inguinal lymph node using a miniature pumping system developed for insulin delivery 
(MiniMed; Northridge, CA). This mfiision cycle was selected m order to mimic the kinetics of 

3 5 antigen presentation during a natural infection. 
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B. Controlled release 

5 A peptide fonnulation is delivered usmg controlled PLGA microspheres, which alter the 

pharmacokinetics of the peptide and improve unmunogenicity. This formulation is injected or 
taken orally. 

C> Gene gun delivery 

A peptide formulation is prepared wherein the peptide is adhered to gold microparticles. 
10 The particles are delivered in a gene gun, being accelerated at high speed so as to penetrate the 
skin, canying the particles into dermal tissues that contain pAPCs. 
D. Aerosol delivery 

A peptide formulation is inhaled as an aerosol, for uptake into appropriate vascular or 
lymphatic tissue in the lungs^ 
15 Example 13. Preparation of a nucleic acid vaccine 

A carrier plasmid vector, pVAXl (Invitrogen, Carlsbad, CA), containing a kanamycin 
resistance gene and a CMV promoter, was modified to include two sequences containing the 
desired epitopes. In addition it contamed an IRES sequence situated between two epitopes to allow 
their simultaneous expression using one promoter. A suitable E. coli strain was then transfected 
20 wiA the plasmid and plated out onto selective media. Several colonies were grown up in 
suspension culture and positive clones were identified by restriction mapping. The positive clone 
was then grown up and aliquotted into storage vials and stored at -TO^'C. 

A mini-prep (QIAprep Spin Mini-prep: Qiagen, Valencia, CA) of the plasmid was then 
made from a sample of these cells and automated fluorescent dideoxy sequence analysis was used 
25 to confirm that die construct had the desired sequence. Further nucleic acid vaccine vectors and 
formulations are described in the foregoing sections of this specification, and in Examples 18-20 
below. 

Example 14. Delivery of a nucleic acid vaccine 

A nucleic acid vaccine is injected into a lymph node using a miniature pumping system, 
30 such as the MiniMed insulin pump. A nucleic acid constructs formulated in an aqueotis buffered 
solution containing an antimicrobial agent, an antioxidant, and an immunomodulating cytokine, is 
delivered over a several day infusion cycle in order to mimic the kinetics of antigen presentation 
during a natural infection. 

Optionally, the nucleic acid construct is delivered using controlled release substances, such 
35 as PLGA microspheres or other biodegradable substances. These substances are injected or taken 
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orally. The nucleic acid vaccine is given using oral delivery, priming the immune response 
through uptake into GALT tissues. Alternatively, the nucleic acid vaccme is delivered using a 
gene gun, wherein the nucleic acid vaccine is adhered to minute gold particles. Nucleic acid 
constructs can also be inhaled as an aerosol, for uptake into appropriate vascular or lymphatic 
S tissue in the lungs. 

Example IS. Assay for presentation of housekeeping epitope on pAPCs 

A. Tetramer analy sis 

Class I tetramer analysis is used to determme T cell frequency in an animal before and 
after administration of a housekeepmg epitope. Clonal expansion of T cells in response to an 
10 epitope indicates that Ae epitope is presented to T ceUs by pAPCs. The specific T cell frequency 
is measured against the housekeeping epitope before and after administration of tfie epitope to an 
animal, to determine if the epitope is present on pAPCs. An increase in frequency of T ceUs 
specific to the epitope after administration indicates that the epitope was presented on pAPC. 

B. Proliferation assav 

15 Approximately 24 hours after vaccination of an animal with an housekeeping epitope, 

pAPCs are harvested from PBMCs, splenocytes, or lymph node cells, using monoclonal antibodies 
against specific markers present on pAPCs, fixed to magnetic beads for affinity purification. 
Crude blood or splenoctye preparation is enriched for pAPCs using this technique. The emiched 
pAPCs are tfien used in a proliferation assay against a T cell clone that has been generated and is 

20 specific for the housekeeping epitope of interest The pAPCs are coincubated with the T ceU clone 
and the T cells are monitored for proliferation activity by measuring the incorporation of. 
radiolabeled thymidme by T cells. Proliferation indicates that T cells specific for the housekeeping 
epitope are being stimulated by that epitope on the pAPCs. 

Example 16. Assav for effe ctiveness of housekeeping epitopes as anticancer treatment 
25 Surrogate endpoints or survival are used to determine the effectiveness of epitope 

synchronization vaccines m cancer treatment 
A. T cell freauencv analvsis 

A useful surrogate endpoint is the determination of T cell frequency against the 
housekeeping epitope used in immunization. Patients developing elevated T cell frequencies 
30 against specific TuAA epitopes used in tumor immunotherapy have significantly better survival 
compared to patients immunized by the same epitope but not developing increased T cell 
frequency to the epitope. Tetramer analysis, ELISPOT analysis, or limiting dilution analysis are 
used to assess T cell frequency to a housekeeping epitope before and after immunization with the 
epitope, indicating the anticancer effectiveness of a housekeeping epitope m a vaccme. 
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B. Tumor burd en/survival analysis 

An animal with an existing tumor is assessed for tumor burden before and after 
immunization with a housekeeping epitope. Partial or complete tumor regression indicates 
eflFective therapeutic intervention, and correlates with improved survival. In a laboratory setting, 
5 several animals are inoculated in parallel vwA a tumor. Some of the animals are then immunized 
Willi a housekeeping epitope vaccine. Survival of animals immunized with the housekeeping 
epitope is compared to those which received a control epitope or placebo, to determine 
effectiveness of the vaccine. 

C. Chromium release assay 

10 An animal genetically engmeered to express human class I MHC is unmunized using a 

housekeeping epitope. T cells from these animals are used in a standard chromium release assay 
using human tumor targets or targets engineered to express the same class I MHC. T cell killing of 
tiie targets indicates that stimulation of T cells in a patient would be eflfective at killing a tumor 
expressing a similar TuAA. 

15 Example 1 7: Identification of Unique Housekeeping Epitop es 

Epitopes useful in the vaccines and methods of the present invention can be readily 
identified as disclosed herein. For example, three unique housekeeping epitopes that are not 
produced by pAPCs have been identified as follows: 

A. Isolation and Purification 

Immune or housekeeping proteasome complexes are isolated. The purified peptide is 
dissolved in an appropriate buffer to a concentration of about 1 to 2 mM and added to 
q>proximately 2 volumes of the proteasome preparation. The buffer chosen must solvate the 
peptide without interfering with the digestion process. An additional digest is prepared using the 
positive control peptide described above to verify proper functioning of the proteasome preparation 
25 used. These are incubated at 37°C for periods of up to 120 minutes and then the digestion is 
stopped by the addition of dilute trifluoroacetic acid; the samples are analyzed immediately by 
mass spectrometry, or they are frozen on dry ice until analysis. The digest reaction can also be 
halted by puttmg samples on ice for immediate analysis by mass spectrometiy. 

B. MALDI-TQF mass spectrometric analvsis of the digest 

30 Approximately 0.5 ^1 of each digest was mixed with an equal volume of the matrix 

solution (10 mg/ml dihydroxybenzoic acid in 70% EtOH, pH 2-3) directly on the sample slide and 
allowed to air dry at about 40*'C. The samples were then analyzed on a Lasermat™ MALDI-TOF 
mass spectrometer that was calibrated with suitable molecular weight standards. 



20 
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Tlie computer program developed for the proteasome assay generates the sequence and 
molecular weight of all the possible fragments that satisfy both requirements of having the correct 
C-termimis of any predicted epitope, and of containing the full length of that epitope or longer. 

When the MALDI-TOF results showed that a particular molecular weight was represented 
5 in a digestion mfacture, the corresponding peptide was synthesized, purified, identified by MALDI- 
TOF and then subjected to reverse phase analytical HPLC to establish a standard retention time 
and an approximate mass to peak area ratio. These procedures are directly analogous to those 
described above. A replicate proteasome digest was then diluted in an appropriate solvent and 
analyzed using the same analytical HPLC method. When the digest gives a peak in good yield that 

10 V has the same retention time as that of the standard, it is ahnost certain that it is due to the presence 
of that sequence in the digest. When there is any ambiguity due to the possible generation of other 
fragments that would give rise to the same or similar mass spectrometry results, the suspect 
component can be collected and set aside for sequencing to confirm identity. Using the above 
method, housekeeping epitopes were identified. 

15 C. Additional Unique Housekeeping Epitope 

Binding of a candidate epitope to HLA-A2.1 was assayed according to the method of 
Stauss et al., (Proc Natl Acad Sci USA 89(17):7871-5 (1992)). T2 cells, which express empty or 
unstable MHC molecules on their surface, were washed twice and suspended at 5x10^ cells/ml in . 
serum-free complete Iscove's modified Dulbecco's medium (IMDM). 62 microglobulin (Sigma, 

20 St. Louis, MO) was added at S^ig/ml and the cells distributed to a 96-well U-bottom plate at 5x10^ 
cells/well. Peptides were added at 100, 10, 1 and Cljig/mL The plate was rocked gently for 2 
minutes and then incubated for 4 hours in a 5% CO2 incubator at 37**C. After flie unbound peptide 
was removed by washing twice with IMDM, a saturatmg amount of monoclonal antibody W6/32 
(Sigma) was added. After incubation for 30 minutes at 4°C, cells were washed with PBS 

25 supplemented with 1% heat-inactivated PCS, 0.1%(w:v) sodium azide, pH 7.4-7.6 (staining 
buffer), and incubated with fluorescein isothiocyanate (FITC>conjugated goat F(ab') antimouse- 
IgG (Sigma) for 30 min at 4^C and washed four times as before. The cells were resuspended in 
staim'ng buffer and fixed by adding a quarter volume of 2% paraformaldehyde. The analysis of 
surface HLA-A2.1 molecules stabilized by peptide binding was performed by flow cytometry 

30 using a FACScan (Becton Dickinson, San Jose, CA). 

Using the method discussed above, a candidate tyrosinase housekeeping epitope identified 
by proteasomal digestion, (tyrosinase 207-216, FLPWHRLFLL SEQ ID NO: 78) was found to bind 
HLA-A2.1 to a similar extent as the known A2.1 binder FLPSDYFPSV (SEQ ID NO: 79) (positive 
control). HLA-B44 bindmg peptide AEMGKYSFY (SEQ ID NO: 80) used as a negative control. 

35 The fluoresence obtained firom the negative control was similar to the signal obtained when no 
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peptide was used in the assay. Positive and negative control peptides were chosen from Table 
18.3.1 in Current Protocols in Immunology p. 18.3.2, John Wiley and Sons, New York, 1998. 



Example 18. Construction of pVAX-EP1>IRES»-EP2 
Overview: 

The starting plasmid for this construct is pVAXl purchased from Invitrogen.(Carlsbad, 
10 CA) Epitope EPl and EP2 were synthesized by GIBCO BRL (Rockville, MD). IRES was cut out 
from pIRES purchased from Clontech (Palo Alto, CA). See Figure 12. 
Procedure: 



1 Digest pIRES with EcoRI and NoH. Separate the digested fragments with agarose gel, and 
purify the IRES fragment by gel purification; 
15 2 Digest pVAXl with EcoRI and Notl. Gel-purify the pVAXl fragment; 

3 Set up a ligation containing the purified pVAXl and IRES fragment 

4 Transform competent DH5a with the ligation mixture; 

5 Pick up 4 colonies and make a miniprep. 

6 Perform restriction enzyme digestion analysis of the miniprep DNA. One recombmant . 
20 colony having the IRES insert was used for further insertion of EPl and EP2. This 

intermediate construct was called pVAX-IRES. 

7 Synthesize EPl and EP2; 

8 Subclone EPl into pVAX-IRES between AfUI and EcoRI site, to make pVAX-EPl- 
IRES; 

25 9 Subclone EP2 into pVAX-EPl-IRES between Sail and Notl, to make the final 

construct pVAX-EPl-IRES-EP2; . 
10 Sequence the EP1-IRES-EP2 insert to confirm sequence. 

Example 19. Construction of pVAX-EP1-IRES-EP2-ISS-NIS 
Overview: 

30 The starting plasmid for this construct was pVAX-EPl-nUES-EP2 (Example 18). ISS 



(immunostimulatory sequence) (SEQ ID NO: 82) introduced to this construct is AACGTT, and the 
NIS (standing for nuclear import sequence) (SEQ ID NO: 81) used is the SV40 72bp repeat 
sequence. ISS-NIS was synthesized by GIBCO BRL. See Figure 9. 
Procedure: 

35 1 Digest pVAX-EPl-IRES-EP2 wilh Nrul. Gel-purify tiie linearized plasmid; 
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2 Synthesize ISS-NIS; 

3 Set up a ligation reaction containing the purified linearized pVAX-EPl-IRES-EP2 and 
synthesized ISS-NIS; 

4 Transform competent DH5a with the ligation product; 
5 5 Pick up colonies and make a miniprep; 

6 Cany out restriction enzyme digestion of the miniprep; 

7 Sequence the plasmid with the msert. 



Example 20. Construction of pVAX-EP2-UB->EPl 
Overview: 

10 The starting plasmid for this construct is pVAXl (Invitrogen). EP2 and EPl were 

synthesized by GIBCO BRL. Wild type Ubiquitin cDNA encoding the 76 amino acids in the 
construct was cloned firomyeasL See Figure 10. 
Procedure: 

1 Perform RT-PCR using yeast mKNA. Primers were designed to amplify the complete 
15 coding sequence of yeast Ubiquitin; 

2 Analyze the RT-PCR products usmg agarose gel. Gel-purify the band with the predicted 

size; 

3 Subclone the purified DNA band into pZEROl at EcoRV site. The resulting clone was . 
named pZERO-UB; 

20 4 Sequence several clones of pZERO-UB. Confirm Ubiquitin sequence before further 

manipulations; 

5 Synthesize EPl and EP2; 

6 Ligate EP2, Ubiquitin and EPl and clone the insert into pVAXl between BamHI and 
EcoRI, making it under control of the CMV promoter; 

25 7 Confirm the sequence of the insert EP2-UB-EP1 by sequencing. 
IRES fSEOIDNO: 6^ 

AATTCCGCCCCTCTCCGTCCCCCCCCCCTAACGTTACTGGCCGAAGCCGCTTGGAATAA 
GGCCGGTGTGCGTTTGTCTATATGTGATTTTCCACCATATTGCCGTCTTTTGGCAATGTG 
AGGGCCCGGAAACCTGGCCCTGTCTTCTTGACGAGCATTCCTAQGGGTCTTTCCCCTCT 
30 CGCCAAAGGAATGCAAGGTCTGTTGAATGTCGTGAAGGAAOCAGTTCCTCTGGAAGCT 
TCTTGAAGACAAACAACGTCTGTAGCGACCCTTTGCAGGCAGCGGAACCCCCCACCTC 
GCGACAGGTGCCTCTGCGGCCAAAAGCCACGTGTATAAGATACACCTGCAAAGGCGGC 
ACAACCCCAGTGCCACGTTGTGAGTTGGATAGTTGTGGAAAGAGTCAAATGGCTCTCC 
TCAAGCGTATTCAACAAGGGGCTGAAGGATGCCCAGAAGGTACCCCATTGTATGGGAT 
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CTGATCTGGGGCCTCGGTGCACATGCTTTACATGTGTTTAGTCGAGGTTAAA^ 
CTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTTGAAAAACACGATGAT^ 
Reference: Clontech PT3266-5 

5 UBIOinilN f SEP ID NO: S> 

ATGCAGATTTTCGTCAAGACTTTGACCGGTAAAACCATAACATO 
CGATACCATCGACAACGTTAAGTCGAAAATTCAAGACAAGGAAGGTATCCCTCCAGAT 
CAACAAAGATTGATCTTTGCCGGTAAGCAGCTAGAAGACGGTAGAACGCTGrc^ 
ACAACATTCAGAAGGAGTCCACCTTACATOTGTGCTAAGGCTAAGAGGTGGC 
10 Reference: Ozkaynak, E., Finley, D., Solomon, MJ. and Varshavsky, A., The yeast ubiquMn 
genes: a fanoiily of natural gene fusions. EMBO J. 6 (5), 1429-1439 (1987). 

NIS fSEOroNQ:8n 

TGGTTGCTGACTAATTGAGATGCATGCTTTGCATACTTCTGCCTGCTGGGGAC^^ 
15 GACTTTCCACACC 

Reference: Dean DA, Dean BS, MuUer S, Smith LC, Sequence requirements for plasmid nuclear 
import. Exp. Cell Res. 253 (2): 713-22 (1999). 

ISS (SEP ID NO: 82-1 
20 AACGTT 

Reference: Sato Y, Roman M, Ti^e H, Lee D, Corr M, Nguyen M, Silvennan GJ, Lotz M, 
Carson DA and Raz E, Immunostimulatory DNA sequences necessary for effective intradermal 
gene immunization. Science, 273: 352-354 (1996). 

25 Examples 21-24 all concern the prediction of 9-mer epitopes presented by HLA-A2.1, 

although the procedure is equally applicable to any HLA type, or epitope length, for which a 
predictive algorithm or MHC binding assay is available. 
Example 21. Melan-A(MART-1 rSBGIDNP: n 

This melanoma tumor-associated antigen (TAA) is 1 18 amino acids in lengfli. Of the 110 

30 possible 9-mers, 16 are given a score 16 by the SYFPEITHI/Rammensee algorithm. (See Table 8). 
These represent 14.5% of the possible peptides and an average epitope density on the protem of 
0.136 per amino acid. Twelve of these overlap, covering amino acids 22-49 of SEQ ID NO: 1 
resulting in an epitope density for the cluster of 0.428, giving a ratio, as described above, of 3.15. 
Another two predicted epitopes overlap amino acids 56-69 of SEQ ID NO: 1, giving an epitope 
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density for the cluster of 0.143, which is not appreciably different than the average, with a ratio of 
just 1.05. See Figure 15. 



Table 8 

5 SYFPEITHI rRammensee algorithm^ Results for Melan-A/MART- 1 (SEP ID NO: 1) 



Rank 


Start 


Score 


1 


31 


27 


2 


56 


26 


3 


35 


26 


4 


, 32 


25 


5 


27 


25 


6 


29 


24 


7 


34 


23 


8 


61 


20 


9 


33 


19 


10 


22 


19 


11 


99 


18 


12 


36 


18 


13 


28 


18 


14 


87 


17 


15 


41 


17 


16 


40 


16 



Restricting the analysis to the 9-mere predicted to have a half time of dissociation of 5 
minutes by the BIMAS-imi/Paiker algorithm leaves only 5. (See Table 9). The average density 
10 of epitopes in the protein is now only 0.042 per amino acid. Three overlapping peptides cover 
amino acids 3 1-48 of SEQ ID NO: 1 and ttie other two cover 56-69 of SEQ ID NO: 1, as before, 
givmg ratios of 3,93 and 3.40, respectively. (See Table 10). 



Table 9 

15 BIMAS-Nm/Parker algorit hm Rftsiiltg f or Melan-A/lVlART-1 f SEP ID NO: n 



Rank 


Start 


Score 


Log(Score) 


1 


40 


1289.01 


3.11 


2 


56 


1055.104 


3.02 


3 


31 


81.385 


1.91 


4 


35 


20.753 


132 


5 


61 


4.968 


0.70 



4 
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Table 10 

Predicted Epitope Clusters for Melan-A/MART-I (SEP ID NO: 1 > 







CalculationsfEDitopes/AAs't 






Cluster 


AA 


Peptides Cluster 


Whole protein 


Ratio 


1 


31-48 


3,4,1 0.17 


0.042 


3.93 


2 


56-69 


2,5 0.14 


0.042 


3.40 



5 Example 22. SSX-2/HOM-MEL40 (SEP ID NO: 2^ 

This melanoma tumor-associated antigen (TAA) is 188 amino acids in length. Of the 180 
possible 9-mers, 11 are given a score 16 by the SYFPEITHI/Rammensee algorithm. (See Table 
11). These represent 6.1% of tiie possible peptides and an average epitope density on the protein of 
0.059 per amino acid. Three of these overlap, covering amino acids 99-114 of SEQ ID NO: 2 

10 resulting in an epitope density for the cluster of 0,188, giving a ratio, as described above, of 3.18. 
There are also overlapping pairs of predicted epitopes at amino acids 16-28, 57-67, and 167-183 of 
SEQ ID NO: 2, giving ratios of 2.63, 3.11, and 2.01, respectively. There is an additional predicted 
epitope chister covering amino acids 5-28 of SEQ ID NO: 2, Evaluating the region 5-28 of SEQ 
ID NO: 2 containing three epitopes gives an epitope density of 0.125 and a ratio 2.14. (See 

15 Table 12). 

Restricting tiie analysis to the 9-mers predicted to have a half time of dissociation of 5. 
minutes by the BIMAS-NIH/Parker algorithm leaves only 6. (See Table 13). The average density 
of epitopes in the protein is now only 0.032 per amino acid. Only a single pair overiap, at 1 67-1 80 
of SEQ ID NO: 2, with a ratio of 4,48. However tiie top ranked peptide is close to another single 
20 predicted epitope if that region, amino acids 41-65 of SEQ ID NO: 2, is evaluated the ratio is 2.5 1, 
representing a substantial difference from the average. (See Figure 16 and Table 14). 

Table 11 . 

SYFPEITHI/Rammensee algorithm for SSX-2/HOM-MEL-40 (SEO ID NO: 2^ 

25 



Rank 


Start 


Score 


1 


103 


23 


2 


167 


22 


3 


41 


22 


4 


16 


21 


5 


99 


20 


6 


59 


19 


7 


20 


17 


8 


5 


17 


9 


175 


16 


10 


106 


16 


11 


57 


16 
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Table 12 

CalculationsflBpitopes/AAs) rSEO ID NO: 2^ 







Calculations^Ditoises/AAs) 






Cluster 


AA 


Peptides 


Cluster 


Whole protein 


Ratio 


1 


Sto28 


8,4,7 


0.125 


0.059 


2.14 


2 


16-28 


4,7 


0.15 


0.059 


2.63 


3 


57-67 


11,6 


0.18 


0.059 


3.11 


4 


99-114 


5, 1, 10 


0.19 


0.059 


3.20 


5 


167-183 


2,9 


0.12 


0.059 


2.01 



5 

Table 13 

BIMAS-NimParicer algorithm (SEP ID NO: 2^ 



Sank 


Start 


Score 


Log(Score) 


1 


41 


1017.062 


3.01 


2 


167 


21.672 


1.34 


3 


57 


20.81 


1.32 


4 


103 


10.433 


1.02 


5 


172 


10.068 


1.00 


6 


16 


6.442 


0.81 



10 

Table 14 

CalculationsffioitoDes/AAs^ f SEP ID NO: 2'> 



Cluster 


AA 


Peptides 


Cluster 


Whole protein 


Ratio 


1 


41-65 


1,3 


0.08 


0.032 


2.51 . 


2 


167-180 


2.5 


0.14 


0.032 


4.48 



15 Example 23, NY-ESQ (SEP ID NO: 3^ 

This tumor-associated antigen (TAA) is 180 amino acids in length. Of the 172 possible 9- 
mers, 25 are given a score 16 by the SYFPEITHI/Rammensee algorithm. (See Table 15). Like 
Melan-A above, these represent 14.5% of the possible peptides and an average epitope density on 
the protein of 0.136 per amino acid. However the distribution is quite diSerent Nearly half the 

20 protein is empty with just one predicted epitope in the first 78 amino acids. Unlike Melan- A where 
there was a very tight cluster of highly overlapping peptides, in NY-ESO the overlaps are smaller 
and extend over most of the rest of the protein. One set of 19 overlapping peptides covers amino 
acids 108-174 of SEQ ID NO: 3, resulting in a ratio of 2.04. Another 5 predicted epitopes cover 
79-104 of SEQ ID NO: 3, for a ratio of just 1.38. (See Table 16). 

25 If instead one takes the approach of considering only the top 5% of predicted epitopes, in 

this case 9 peptides, one can examine whedier good clusters are being obscured by peptides 
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predicted to be less likely to bind to MHC. When just these predicted epitopes are considered we 
see that the region 108-140 of SEQ ID NO: 3 contains 6 overlapping peptides with a ratio of 3.64. 
There are also 2 nearby peptides in the region 148-167 of SEQ ID NO: 3 with a ratio of 2.00. Thus 
the large cluster 108-174 of SEQ ID NO: 3 can be broken into two smaller clusters covering much 
5 of die same sequence. (See Table 16). 

Bestricting the analysis to the 9-mers predicted to have a half tune of dissociation of 5 
minutes by the BIMAS-NIH/Parker algorithm brings 14 peptides into consideration. (See Table 
17). The average density of epitopes in the protein is now 0.078 per amino acid. A smgle set of 10 
overlapping peptides is observed, covering ammo acids 144-171 of SEQ ID NO: 3, with a ratio of 
10 4.59. Ail 14 peptides fell m the region 86-171 of SEQ ID NO: 3 which is still 2.09 times the 
average density of epitopes m the protein. While such a large cluster is larger than we consider 
ideal it still offers a significant advantage over woricing with the whole protein. (See Figure 17 and 



Table 18). 



15 



Table IS 

SYFPEITHI rRammensee algorithm^ Results for NY-ESO TSEO ID NO: 3^ 



Rank 



Start 



Score 



1 

2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 



108 

148 
159 
127 
86 
132 
122 
120 
115 
96 
113 
91 
166 
161 
157 
151 
137 
79 
.139 
131 
87 
152 
144 
129 
15 



25 
24 
21 
21 
21 
20 
20 
20 
20 
20 
19 
19 
18 
18 
18 
18 
18 
18 
17 
17 
17 
16 
16 
16 
16 



-78- 



wo 01/82963 PCTAJSOl/13806 



Table 16 

Calculationsf EoitODes/AAs^ (SEP TP NO: 3^ 





AA 

An 




Cluster 


Whole protein 


Ratio 


1 


108- 
140 


1,9,8.7,4,6 


0.18 


0.05 


3.64 


2 


148- 
167 


2,3 


0.10 


0.05 


2.00 


3 


79- 
104 


512,10,18,21 


0.19 


0.14 


138 


4 


108- 

174 


1,11,9,8,7,4,6,17,2,16, 15, 3, 14, 

13, 24, 20, 19, 23, 22 


0.28 


0.14 


2.04 



5 

Table 17 

BIMAS-NIH/Paifcer algorithm Results forNY-ESO fSEO ID NO: 3^ 



Rank 


Start 


Score 


Log(Score) 


1 


159 


1197321 


3.08 


2 


86 


429.578 


2.63 


3 


120 


130.601 


2.12 


4 


161 


83.584 


1.92 


5 


155 


52.704 


1.72 


6 


154 


49.509 


1.69 


7 


157 


42.278 


1.63 


8 


108 


21.362 


1.33 


9 


132 


19.425 


1.29 


10 


145 


13.624 


1.13 


11 


163 


11.913 


1.08 


12 


144 


11.426 


1.06 


13 


148 


6.756 


0.83 


14 


152 


4.968 


0.70 



10 

Table 18 

CalculatiopsfiBoitopes/AAs') f SEP ID NO: 3'> 



Cluster 


AA 


Peptides 


Cluster 


Whole protein 


Ratio 


1 


86-171 


2,8,3,9,10,12,13, 14,6,5,7,1,4, 

11 

10, 12,13, 14,6,5,7,1,4,11 


0.163 


0.078 


2.09 


2 


144- 
171 


0.36 


0.078 


4.59 



15 Example 24. Tyrosinase (SEQ ID NO: 4) 

This melanoma tumor-associated antigen (TAA) is 529 amino acids in length. Of the 521 
possible 9-mers, 52 are given a score 16 by the SYFPEITHI/Rammensee algorithm. (See Table 
19). These represent 10% of the possible peptides and an average epitope density on tiie protein of 
0.098 per amino acid. There are 5 groups of overl2q)ping peptides containing 2 to 13 predicted 
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epitopes each, with ratios ranging from 2.03 to 4.41, respectively. There are an additional 7 groups 
of overlapping peptides, containmg 2 to 4 predicted epitopes each, with ratios ranging from 1.20 to 
1.85, respectively. The 17 peptides in the region 444-506 of SEQ ID NO: 4, including the 13 
overlapping peptides above, constitutes a cluster with a ratio of 2.20. (See Table 20). 

Restricting the analysis to the 9-mers predicted to have a half tune of dissociation of 5 
minutes by the BIMAS-NIH/Parker algorithm brings 28 peptides into consideration. (See Table 
21), The average density of epitopes in the protem imder this condition is 0.053 per amino acid. At 
this density any overlap represents more than twice the average density of epitopes. There are 5 
groups of overlapping peptides containing 2 to 7 predicted epitopes each, with ratios ranging from 
2.22 to 4.9, respectively. (See Table 22). Only three of these clusters are common to the two 
algorithms. Several, but not all, of these clusters could be enlarged by evaluating a region 
contaming them and nearby predicted epitopes. See Figure 18. 
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Table 19 

SYFPEITHI/Rammensee a1p ;nrithm Results for TymsinasR (SEQ ID NO: 4) 



Rank 


Start 


Score 




Sank 


Stort 


Score 


1 


490 


34 




27 


5 


19 


2 


491 


31 




28 


484 


18 


3 


487 


28 




29 


476 


18 


4 


1 


27 




30 


463 


18 


5 , 


2 


25 




31 


444 


18 


6 


482 


23 




32 


425 


18 


7 


380 


23 




33 


316 


18 


8 


369 


23 




34 


187 


18 


9 


214 


23 




35 


402 


17 


10 


506 


22 




36 


388 


17 


11 


343 


22 




37 


346 


17 


12 


207 


22 




38 


336 


17 


13 


137 


22 




39 


225 


i7 


14 


57 


22 




40 


224 


17 


15 


169 


20 




41 


208 


17 


16 


118 


20 




42 


186 


17 


17 


9 


20 




43 


171 


17 


18 


488 


19 




44 


514 


16 


19 


483 


19 




45 


494 


16 


20 


480 


19 




46 


406 


16 


21 


479 


19 




47 


385 


16 


22 


478 


19 




48 


349 


16 


23 


473 


19 




49 


m 


16 


24 


365 


19 




50 


167 


16 


25 


287 


19 




51 


145 


16 


26 


200 


19 




52 


139 


16 



-80- 



wo 01/82963 



PCTAJSOl/13806 



Table 20 

CalculationsrEDitDPes/AAs^ (SEQ ID NO: 4) 



Cluster AA 



Peptides 



Cluster Whole protein Ratio 



3 
4 



167-179 
184-195 

224-233 



4,5,27,17 



15, 43, 50 
34 42 49 

39, 40 



0.24^ ^0.098 2.39 

0.23 0.098 2.35 

0.25 0.098 2.54 

0.20 0.098 2.03 



11 



473-502 




29,28,23.22,21,20,6,19, 



0.43 



0.098 



444-522 3 1, 30, 23, 29, 22, 21, 20, 6, 
19,28,3,18.1,2,45,10,44 



0.22 



0.098 



4.41 



2.20 



Table 21 

BIMAS-I<nH/Paricer algoritiun Results (SEQ ID NO: 4) 



Rank 


Stftrt 


Score 


Log(Score) 


1 


207 


540.469 


2.73 


2 


369 


531.455 


2.73 


3 


1 


309.05 


2.49 


4 


9 


266.374 


2.43 


5 


490 


181.794 


2.26 


6 


214 


177.566 


2.25 


7 


224 


143.451 


2.16 


8 


171 


93.656 


i.97 


9 


506 


87.586 


1.94 


10 


487 


83.527 


1.92 


11 


491 


83.527 


1.92 


12 


2 


54.474 


1.74 


13 


137 


47.991 


1.68 


14 


200 


30.777 


1.49 


15 


208 


26.248 


1.42 


16 


460 


21.919 


1.34 


■ 17 


478 


19.425 


1.29 


18 


365 


17.14 


1J23 


19 


380 


16.228 


1.21 


20 


444 


13.218 


1.12 


21 


473 


13.04 


1.12 


22 


57 


10.868 


1.04 


23 


482 


8.252 


0.92 


24 


483 


7.309 


0.86 


25 


5 


6.993 


0.84 


26 


225 


5.858 


0.77 


27 


343 


5.195 


0.72 


28 


514 


5.179 


0.71 
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Table 22 

CalcttlatjonsflBpitopes/AAs) (SEQ ID NO: 4) 



Cluster 


AA 


Peptides 


Cluster 


Whole protein 


Ratio 


1 


ltol7 


3, 12,25,4 


0.24 


0.053 


4.45 


2 


200-222 


14. 1, 15, 6 


0.17 


0.053 


3.29 


3 


224-233 


7,26 


0.20 


0.053 


3.78 


4 


365-377 


18,2 


0.15 


0.053 


2.91 


5 


473-499 


21, 17,23,24,10,5,11 


0.26 


0.053 


4.90 


6 


506-522 


9,28 


0.12 


0.053 


2.22 


7 


365-388 


18,2, 19 


0.13 


0.053 


2.36 


8 


444-499 


20,16,21, 17,23,24.10,5,11 


0.16 


0.053 


3.03 


9 


444-522 


20, 16, 21, 17, 23, 24, 10. 5, 1 1, 9, 28 


0.14 


0.053 


2.63 


10 


200-233 


14, 1, 15, 6. 7, 26 


0.18 


0.053 


3.33 
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WHAT IS CLAIMED TS : 

1. A vaccine comprising a first housekeeping epitope, wherein the housekeeping epitope 
is derived from a first antigen associated with a first target cell. 

2. The vaccine of claim 1 comprising a nucleic acid construct, wherein the construct 
5 encodes the housekeeping epitope. 

3. The vaccine of claun 1 or 2, wherem the target cell is a neoplastic cell. 

4. The vaccine of clahn 3, wherem the neoplastic cell is selected from the group 
consisting of leukemia, carcinoma, lymphoma, astrocytoma, sarcoma, glioma, retinoblastoma, 
melanoma, Wilm's tumor, bladder cancer, breast cancer, colon cancer, hepatocellular cancer, 

10 pancreatic cancer, prostate cancer, lung cancer, liver cancer, stomach cancer, cervical cancer, 
testicular cancer, renal cell cancer, and brain cancer. 

5. He vaccine of claim 3, wherein the antigen is selected from the group consistmg of 
MelanA (MART-I), gplOO (Pmel 17), tyrosinase, TRP-1, TRP-2, MAGE-1, MAGE-3, BAGE, 
GAGE-1, GAGE-2, CEA, RAGE, NY-ESO, SCP-1, Hom/MeMO, FRAME, p53, H-Ras, HER- 

15 2/neu, BCR-ABL, E2A-PRL, H4-RET, IGH-IGK, MYL-RAR, Epstein Barr virus antigens, EBNA, 
human papillomavirus (HPV) antigens E6 and E7, TSP-180, MAGE-4, MAGE-5, MAGE-6, 
pl85erbB2, pl80erbB-3, c-met, mn-23Hl, PSA, TAG-72.4, CAM 17.1, NuMa, K-ras, p-Catenin, 
CDK4, Mum-1, pl6, TAGE, PSMA, PSCA, CT7, telomerase, 43-9F, 5T4, 791Tgp72, alpha- 
fetoprotein , B-HCG, BCA225, BTAA, CA 125, CA 15-3 (CA 27.29\BCAA), CA 195, CA 242, 

20 CA-50, CAM43, CD68\KP1, CO-029, FGF-5, G250, Ga733 (EpCAM), HTgp-175, M344, MA-50, 
MG7-Ag, MOV18, NB/70K, NY-COl, RCASl, SDCCAG16, TA-90 {Mac-2 binding 
proteinXcyclophilm C-associated protein), TAAL6, TAG72, TLP, TPS, and GA733-2\KSA. 

6. The vaccine of claim 1 or 2, wherein the target cell is infected by an intracellular 
parasite. 

25 7. The vaccine of claim 6, wherein the intracellular parasite is a virus. 

8. The vaccine of claim 7, wherein the virus is selected from the group consisting of: 
adenovirus, cytomegalovirus, Epstein-Barr virus, herpes simplex virus 1, herpes simplex virus 2, 
human herpesvirus 6, varicella-zoster virus, hepatitis B virus, hepatitis D virus, papilloma vnus, 
parvovirus B19, polyomavirus BK, polyomavirus JC, hepatitis C virus, measles virus, rubella 

30 virus, human immunodeficiency virus (HIV), human Tcell leukemia virus I, and human Tcell 
leukemia virus U. 

9. The vaccine of claim 7, wherein the antigen is a virus-associated antigen. 

10. The vaccine of claim 6, wherein the intracellular parasite is a bacterium, a protozoan, a 
fimgus, or a prion. 
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11. The vaccine of claim 10, wherein the intracellular parasite is selected from the group 
consisting of Chlamydia, Listeria, Salmonella, Legionella, Brucella, Coxiella, Rickettsia, 
Mycobacteritmt, Leishmania, Trypanosoma, Toxoplasma, mi Plasmodium. 

12. The vaccine of claim 10, wherein the antigen is parasite-associated antigen. 

5 13. The vaccine of claim 1 or 2, wherein the housekeeping epitope comprises or encodes a 

polypeptide of about 6 to about 23 amino acids m length. 

14. The vaccine of claim 13, the polypeptide being 9 or 10 amino acids in length. 

15. The vaccine of claim 1, wherein the housekeeping epitope comprises a nucleic acid. 

16. The vaccine of claim 1, wherein the housekeeping epitope comprises a synthetic 
10 polypeptide. 

17. The vaccine of any of the preceding claims wherein the vaccine further comprises at 
least one component selected from the group consisting of buffers, detergents, surfactants, anti- 
oxidants, or reducing agents. 

18. The vaccine of any of the preceding claims, wherein the housekeeping epitope is 
1 S specific for at least one allele of MHC. 

19. The vaccine of claim 18, whereia the allele encodes type A2. 

20. The vaccine of claim 1 8, wherein the allele encodes a type selected from the group 
consisting of Al , A3, Al 1 , and A24. 

21. The vaccine of claim 18, wherein the allele encodes a type selected from the group 
20 consisting of A26, A29, B7, B8, B14, B18, B27, B35, B44, B62, B60, and B5L 

22. The vaccine of any of tiie preceding claims, further comprising an hnmune epitope. 

23. The vaccine of clahn 22, wherein the immune epitope is derived from a second antigen 
associated with the target cell. 

24. The vaccine of claim 21, wherein the first antigen and the second antigen are the same. 
25 25. The vaccine of claim 22, 23, or 24 wherein the housekeeping epitope is specific for a 

first allele of MHC, and wherein the immune epitope is specific for a second allele of MHC. 

26. The vaccine of claim 25, wherein the first allele and the second allele are the same. 

27. Tlie vaccine of claim 25, wherein the first allele and the second allele are not the same. 

28. The vaccine of any of clauns 22-27, wherein the vaccine comprises an epitope cluster, 
30 and wherein the epitope cluster comprises the immune epitope. 

29. The vaccine of claim 28, wherein the epitope cluster comprises or encodes a 
polypeptide having a length, wherein the length is at least 10 amino acids. 

30. The vaccine of claim 29, wherein the length of the polypeptide is less than about 60 
amino acids. 
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31. The vaccine of claim 28, wherein the epitope cluster is derived from a second antigen 
associated with the target cell. 

32. The vaccine of claim 3 1, wherein the &st antigen and the second antigen are the same. 

33. The vaccine of claim 31 or 32, the second antigen having a length, wherein the epitope 
5 cluster consists of or encodes a polypeptide having a length, wherein the length of the polypeptide 

is less than about 80% of the length of the second antigen. 

34. The vaccine of claim 33, wherein the length of the polypeptide is less than about 50% 
of the length of the second antigen. 

35. The vaccine of claim 34, wherein the length of the polypeptide is less than about 20% 
10 of the length of the second antigen. 

36. The vaccine of any of the preceding claims, further comprising a second housekeepmg 
epitope, wherein the second housekeeping epitope is derived 6om a second antigen associated with 
a second target cell. 

37. The vaccine of claim 36, wherein the first antigen and the second antigen are the same. 
15 38. The vaccine of claim 36, wherein the first antigen and the second antigen are not the 

same.. 

39. The vaccine of claim 36, wherein the first target cell and the second target cell are the 

same. 

40. Hie vaccine of claim 36, wherein the first target cell and the second target cell are not 
20 the same. 

41. A method of making a vaccine, comprising the steps of: 

selecting a first housekeeping epitope by identifying polypeptide fragments that 
are or could be produced by a housekeeping proteasome from a first antigen associated 
with a first target cell or target pathogen; and 
25 preparing a vaccine composition comprising or encoding the selected 

housekeeping epitope. 

42. An isolated T cell expressing a T cell receptor specific for an MHC-peptide complex 
comprismg a first housekeeping epitope, wherem the housekeeping epitope is derived from a first 
antigen associated with a first target cell. 

30 43. A T cell clone comprising the T cell of claim 42. 

44. A polyclonal population of T cells comprising the T cell of claim 42. 

45. The T cell of claim 42 produced by an in vitro immunization. 

46. The T cell of claim 42 isolated from an immunized animal. 

47. A method of making an adoptive immunotherapeutic, comprising: 
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combining the T cell of any of claims 42-46 with a pharmaceutically acceptable 
adjuvant, carrier, diluent, or excipient 

48. The us© of flie T cell of any of claims 42-46 in the manufacture of a medicament for 
use in adoptive inmiunotherapy. 
5 49. A method of epitope discovery comprising the step of selecting an epitope from a 

population of peptide fragments of an antigen associated with a target in a host, wherein the 
fragments have a known or predicted afiinity for a major histocompatibility complex receptor 
peptide binding cleft of the host, wherein the epitope selected corresponds to a product of 
proteolytic cleavage of the antigen in a cell of the host. 
10 50. The method of claim 49 wherein the target is a target cell, wherein the major 

histocompatibility complex receptor is a class I major histocompatibility complex receptor, 
wherein the product of proteolytic cleavage is a proteasome cleavage product, and wherein the cell 
of the host is the target cell. 

51. The method of claim 49 wherein the major histocompatibili^ complex receptor is a 
1 5 class n major histocompatibility complex receptor. 

52. The method of claim 51, wherein the target is selected &om the group consisting of a 
neoplastic cell, a pathogen, a toxin, a venom, and an allergen. 

53. A method of discovering an epitope comprising the steps of: 

providing a sequence from a target cell, wherein the sequence encodes or 
20 comprises a protein expressed in the target cell; 

identifying a population of peptide fragments of the protem, wherein members of 
tile population of peptide fragments have a known or predicted afBmty for a major 
histocompatibility complex class I receptor peptide binding cleft; 

selecting the epitope from the population of peptide fragments, wherem the 
25 epitope corresponds to a product of a proteasome active in the target cell. 

54. The method of claim 50 or 53, wherein the target cell is a neoplastic cell or an antigen 
presenting cell. 

55. A method of epitope discovery comprising the steps of: 

providing a neoplastic cell and a sequence, wherein the sequence comprises or 
30 encodes an antigen associated with the neoplastic cell; 

identijfying a population of peptide fragments of the antigen, wherein the 
population of peptide fragments is predicted to have an aflfmity for a major 
histocompatibility complex class I receptor peptide bindmg cleft; 



-86- 



wo 01/82963 PCTAJSOl/13806 

selecting an epitope from the population of peptide fragments, wherein the epitope 
is determine by in vitro analysis to be a proteasome cleavage reaction product of a 
proteasome active in the neoplastic cell. 

56. The method of claim 52, 54, or 55, wherem the neoplastic cell is selected from the 
5 group consisting of leukemia, carcinoma, lymphoma, astrocytoma, sarcoma, glioma, 

retinoblastoma, melanoma, Wilm's tumor, bladder cancer, breast cancer, colon cancer, 
hepatocellular cancer, pancreatic cancer, prostate cancer, lung cancer, liver cancer, stomach cancer, 
cervical cancer, testicular cancer, renal cell cancer, and brain cancer. 

57. The method of claim 52, 54, 55 or 56, wherein the antigen is selected from the group 
10 consistmg of MelanA (MART-I), gplOO (Pmel 17), tyrosinase, TRP-1, TRP-2, MAGE-1, MAGE- 

3, BAGE, GAGE-1, GAGE-2, CEA, RAGE, NY-ESO, SCP-1, Hom/Mel-40, FRAME, p53, H-Ras, 
HER-2/neu, BCR-ABL, E2A-PRL, H4-RET, IGHJGK, MYL-RAR, Epstein Barr virus antigens, 
EBNA, human papillomavmis (HPV) antigens E6 and E7, TSP-180, MAGE-4, MAGE-5, MAGE- 
6, pl85erbB2, plSOerbB-S, c-met, nm-23Hl, PSA, TAG.72-4, CAM 17.1, NuMa, K-ras, ^ 
15 Catenin, CDK4, Mum-1, and pl6. 

58. The method of claim 50 or 53, wherein the target cell is infected by an mtracellular 
parasite. 

59. The method of claim 52, wherein the pathogen is an intracellular parasite. 

60. The method of claim 58 or 59, wherein the intracellular parasite is a virus. 

20 61. The method of claim 60, wherein the vuus is selected from the group consisting of: 

adenovirus, cytomegalovmis, Epstein-Barr virus, herpes simplex virus 1, herpes simplex virus 2, 
human herpesvirus 6, varicella-zoster virus, hepatitis B virus, hepatitis D virus, papilloma virus, 
parvovirus B19, polyomavirus BK, polyomavmis JC, hepatitis C virus, measles virus, rubella 
virus, human immunodeficiency virus (HIV), human Tcell leukemia virus I, and human Tcell 

25 leukemia vmis 11, a rotaviruses, a reovirus, a picomavirus, a hepadnavirus, an adenovirus, and a 
papovavirus. 

62. The method of claim 60, wherein the antigen is selected from the group consisting of 
adenovirus, cytomegalovirus, Epstein-Barr virus, herpes simplex virus 1, herpes simplex virus 2, 
human herpesvirus 6, varicella-zoster virus, hepatitis B virus, hepatitis D virus, papilloma vuiis, 

30 parvovirus B19, polyomavirus BK, polyomavmis JC, hepatitis C virus, measles virus, rubella 
virus, human immunodeficiency virus (HIV), human Tcell leukemia vfrus I, and human Tcell 
leukemia virus 11. 

63. The method of claim 52, wherein the pathogen is selected from the group consistmg of 
a bacterium, a fungus, a protozoan, and a prion. 
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64. The method of claim 58, wherein the intracellular parasite is a bacterium, a protozoan, 
a fungus, or a prion. 

65. The method of claim 63 or 64, wherein the pathogen or intracellular parasite is 
selected from the group consisting of Chlamydia, Listeria, Salmonella, Legionella, Brucella, 

5 Coxiella, Rickettsia, Afycobacterium, Leishmania, Trypanasoma, Toxoplasma, and Plasmodium, 

66. The method of claim 58, wherein the antigen is a parasite-associated antigen. 

67. The method of claim 50, 53, or 55, wherein the pioteasome is a housekeeping 
proteasome. 

68. The method of claim 50, 53, or 55, wherein the proteasome is an immune proteasome. 
10 69. The me&od of claim 49 or 51, wherein the proteolytic cleavage comprises an activity 

of a protease in a lysosome of the host 

70. The method of claun 49 or 51, wherein the proteolytic cleavage comprises an activity 
of a protease in an endosome of the host 

71. The method of any of claims 49-70, wherein the afiBnity is determined by an in vitro 
15 binding assay. 

72. The method of any of claims 49-70, wherein the affinity is determined by an 
algorithm. 

73. The method of any of claims 49-72, wherein the product is determined by in vitro 
analysis. 

20 74- An epitope discovered according to the method of any of claims 49-73, 

75. The epitope of claim 74, having a sequence of SEQ IDNO: 78. 

76. An MHC class I-restricted epitope discovered by the process comprising the steps of: 

determining the type of proteasome expressed by a target cell, and 

identifying epitopes expressed by said target cell. 
25 77. The epitope of claim 76 wherein the determining step comprises a technique selected 

from the group consisting of RT-PCR, ELISA, western blotting, polyacrylamide gel 
electrophoresis, immunohistochemistiy, and cleavage of standard substrates with proteasonies 
purified from said target cell. 

78. The epitope of claim 76 wherein the determining step comprises comparmg untreated 
30 and y-IFN treated target cells. 

79. The epitope of claim 76 wherein the identifying step comprises a technique selected 
from the group consisting of peptide elution, in vitro proteolysis, mass spectrometry, HPLC, amino 
acid sequencing, and determination of immimoreactivity. 
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80. The epitope of claim 79 ^^erein the determination of immunoreactivity comprises an 
assay to detect a condition selected from the group consisting of cytotoxicity, proliferation, and 
c^okine production. 

81, A vaccine comprising the epitope of any of claims 74-80. 

5 82. An epitope cluster, the cluster being derived from an antigen associated with a target, 

the cluster comprising or encodmg at least two sequences havmg a known or predicted affinity for 
an MHC receptor peptide binding cleft, wherein the cluster is an incomplete fragment of the 
antigen. 

83. The epitope cluster of claim 82, wherein the target is a neoplastic cell. 
10 84. The epitope cluster of claim 82, wherem die target is a cell infected by an intracellular 

parasite. 

85. The epitope cluster of claim 84, wherein the intracellular parasite is a virus. 

86. The epitope cluster of claim 84, wherein the intracellular parasite is a bacterium or a 
protozoan. 

15 87. The epitope cluster of claim 82, wherem the target is a pathogenic agent. 

88. The epitope cluster of claim 87, wherein the pathogenic agent is selected fix)m the 
group consisting of a virus, a bacterium, a fungus, a protozoan, a prion, a toxin, a venom, and an 
allergen. 

89. The epitope cluster of claim 82, wherein the MHC receptor is a class I HLA receptor. 
20 90. The epitope cluster of claim 82, wherein the MHC receptor is a class 11 HLA receptor. 

91. The epitope cluster of claim 82, wherein the cluster comprises or encodes a 
polypeptide having a length, wherein the length is at least 10 amino acids. 

92. The epitope cluster of claim 91, wherein the length of the polypeptide is less than 
about 75 amino acids. 

25 93. The epitope cluster of claim 82, the antigen having a length, wherein the cluster 

consists of or encodes a polypeptide having a length, wherein the length of the polypeptide is less 
than about 80% of the length of the antigen. 

94. The epitope cluster of claim 93, the antigen having a length, wherein the cluster 
consists of or encodes a polypeptide having a length, wherein the length of the polypeptide is less 

30 than about 50% of the length of the antigen. 

95. The epitope cluster of claim 94, the antigen having a length, wherem the cluster 
consists of or encodes a polypeptide having a length, wherein the length of the polypeptide is less 
than about 20% of the length of the antigen. 

96. A method of epitope cluster discovery comprising the steps of; 
35 providing a sequence of an antigen associated widi a target; 
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scoring candidate peptides within the sequence, based on known or predicted 
ajBinity for an MHC receptor peptide binding cleft, to identify putative MHC epitopes; and 

identifying an epitope cluster within the antigen, wherein the cluster comprises at 
least two of the putative MHC epitopes, and wherein the cluster comprises a higher density 
5 of putative MHC epitopes than a density of putative MHC epitopes in the antigen as a 

whole. 

97. The method of claim 96, wherein the target is a neoplastic cell. 

98. The method of claim 96, wherein the target is a cell infected by an intracellular 
parasite. 

10 99- The melliod of claim 98, wherein the intracellular parasite is a virus. 

100. The method of claim 98, wherein the intracellular parasite is a bacterium or a 
protozoan. 

101 . The method of claim 96, wherein the target is a pathogenic agent 

102. The method of claim 96, wherein the MHC receptor binding cleft is a class I HLA 
15 receptor bmding cleft. 

103. The method of claim 96, wherein the MHC receptor binding cleft is a class U HLA 
receptor binding cleft. 

104. The method of claim 96, wherein the cluster comprises or encodes a polypeptide 
having a length, wherein the length is at least 1 0 amino acids. 

20 105. The method of claim 104, wherein the length of the polypeptide is less than about 

75 amino acids. 

106. The method of claim 96, the antigen having a length, wherein the cluster consists 
essentially of or encodes a polypeptide having a length, wherein the length of the polypeptide is 
less than about 80% of the length of the antigen. 
25 107. The method of claim 106, the antigen having a length, wherein the cluster consists 

essentially of or encodes a polypeptide having a length, wherein the length of the polypeptide is 
less than about 50% of the length of the antigen. 

108. The method of claim 107, the antigen having a length, wherein the cluster consists 
essentially of or encodes a polypeptide having a length, wherein the length of the polypeptide is 

30 less than about 20% of the length of the antigen. 

109. A nucleic acid construct comprising a first coding region, wherein the first coding 
region comprises a first sequence encodmg at least a first polypeptide, wherein the first 
polypeptide comprises a first housekeeping epitope derived fi-om a first antigen associated with a 
first target cell. 
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110. The nucleic acid construct of claim 109, wherein the first coding region further 
comprises a second sequence encoding at least a second polypeptide, wherein the second 
polypeptide comprises a second epitope derived from a second antigen associated with a second 
target cell. 

5 111. The nucleic acid construct of claim 110, wherein the first polypeptide and the 

second polypeptide are contiguous. 

112. The nucleic acid construct of claim 110, wherein the first polypeptide and the 
second polypeptide are not contiguous. 

113. The nucleic acid construct of claim 110, wherein the second epitope is a 
1 0 housekeeping epitope. 

1 14. The nucleic acid construct of claim 1 10, wherein the second epitope is an immune 

epitope. 

1 15. The nucleic acid construct of claim 1 10, wherein the fiirst antigen and the second 
antigen are the same. 

15 116, The nucleic acid construct of claim 1 10, wherem the first antigen and the second 

antigen are not the same. 

1 1 7. The nucleic acid construct of claim 110, wherein the first target cell and the second 
target cell are the same. 

1 18. The nucleic acid construct of claim 110, wherein the first target cell and the second 
20 tai;get cell are not the same. 

1 19. The nucleic acid construct of claim 109, wherein the first target cell is a neoplastic 

ceU. 

120. The nucleic acid construct of clahn 119, wherein the neoplastic cell is selected 
from the group consisting of leukemia, carcinoma, lymphoma, astrocytoma, sarcoma, glioma, 

25 retinoblastoma, melanoma, Wilm's tumor, bladder cancer, breast cancer, colon cancer, 
hepatocellular cancer, pancreatic cancer, prostate cancer, lung cancer, liver cancer, stomach cancer, 
cervical cancer, testicular cancer, renal cell cancer, and brain cancer. 

121 . The nucleic acid construct of claim 1 19, wherem the fu^t antigen is selected from 
the group consisting of MART-l/MelanA, gp 100 (Pmel 17), tyrosinase, TRP-1, TRP-2, MAGE-1, 

30 MAGE-3, BAGE, GAGE-1, GAGE-2, CEA, RAGE, SCP-1, HomMel-40, FRAME, p53, H-Ras, 
HER>2/neu, BCR-ABL, E2A-PRL, h4-RET, IGH-IGK, MYL-RAR, Epstein Barr virus antigens, 
EBNA, human papillomavirus (HPV) antigens E6 and E7, TSP-180, MAGE-4, MAGE-5, MAGE- 
6, pl85erbB2, pl80erbB-3, c-met, nm-23Hl, PSA, TAG-72^, CAM 17.1, NuMa, K-ras, b- 
Catenin, CDK4, Mum-1, pl6, pl5, NY-ESO, products of an SSX gene family member, CT-7, and 

35 products of an SCP gene family member. 
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122. The nucleic acid construct of claim 109, wherein tiie target cell is infected by an 
intracellular parasite. 

123. The nucleic acid construct of claim 122, wherein the intracellular parasite is a 

virus. 

5 124. The nucleic acid construct of claim 123, wherein the virus is selected from the 

group consisting of: adenovirus, cytomegalovirus, Epstein-Barr virus, herpes simplex virus 1 and 
2, human herpesvirus 6, varicella-zoster virus, hepatitis B virus, hepatitis D vnus, papillomavirus, 
parvovirus B19, polyomavirus BK, polyomavirus JC, hepatitis C virus, measles virus, rubella 
virus, human immunodeficiency virus (HTV), human T-cell leukemia virus I, and human T-cell 
10 leukemia virus n. 

125. The nucleic acid construct of claim 122, wherein the intracellular parasite is a 
bacterium, a protozoan, a fungus, or a prion, 

126. The nucleic acid construct of claim 125, wherein the intracellular parasite is 
selected from the group consisting of Chlamydia, Listeria, Salmonella, Legionella, Brucella, 

15 Coxiella, Rickettsia, Afycobacterium, Leishmania, Trypanosoma, Toxoplasma, and Plasmodium. 

127. The nucleic acid construct of Claim 109, further comprising a first promoter 
sequence operably linked to the first coding region. 

128. The nucleic acid construct of claim 127 wherein the promoter is derived from 
cytomegalovirus (CMV), SV40, or a retroviral long terminal repeat (LTR). 

20 129. The nucleic acid construct of Claim 127 wherein the promoter is a bidirectional 

promoter. 

130. The nucleic acid construct of claim 127, further comprising a second promoter 
sequence operably linked to a second coding region. 

131. The nucleic acid construct of claim 109 further comprising a poly-A sequence 
25 operably linked to the fu^t coding region. 

132. The nucleic acid construct of claim 1 10, further comprising an internal ribosome 
entry site (IRES) sequence. 

133. The nucleic acid construct of claim 132, wherein the IRES sequence is between the 
first and the second coding sequences. 

30 134. The nucleic acid construct of claim 1 1 0, further comprising a ubiquitin sequence. 

135. The nucleic acid construct of claim 134, wherein the ubiquitin sequence is between 
the first and the second coding sequences. 

136. The nucleic acid construct of claim 110, further comprising a sequence encoding 
an autocatalytic peptide. 
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137. The nucleic acid construct of claim 136, wherein the autocatalytic peptide- 
encoding sequence is between the first and tiie second coding sequences. 

138. The nucleic acid construct of claim 109, further comprising a sequence selected 
from the group consisting of enhancers, nuclear import sequences, immunostimulatory sequences, 

5 and expression cassettes for cytokines, selection markers, or reporter molecules. 

139. The nucleic acid construct of claim 109, wherein the fust polypeptide is about 7 to 
1 S amino acids in length. 

140. The nucleic acid construct of claim 109, wherein the first polypeptide is 9 or 10 
ammo acids in length. 

10 141. The nucleic acid construct of claim 110, wherein the second polypeptide is 9 or 10 

amino acids in length. 

142. The nucleic acid construct of claim 110, wherem the second polypeptide is an 
epitope cluster between about 10 and about 75 amino acids in length. 

143. The nucleic acid construct of claim 110, wherein the first epitope has a bmding 
15 affinity for a fust MHC allele, and wherein the second epitopes has a binding afifmity for a second ' 

MHC allele. 

144. The nucleic acid construct of claun 143, wherein the first allele and the second 
allele are the same. 

145. The nucleic acid construct of claim 143, wherein the first allele and the second 
20 allele are not the same. 

146. The nucleic acid construct of claim 109 further comprising at least a second coding 
region wherem the second coding region comprises a second sequence encoding at least a second 
polypeptide, wherein the second polypeptide comprises a second epitope derived fi-om a second 
antigen associated with a second target cell. 

25 147. The nucleic acid construct of claim 146 wherein said first and second coding 

regions are operably linked to either side of a single bidirectional promoter. 

148. The nucleic acid construct of claim 146 wherein at least one coding region 
comprises multiple sequences encoding multiple polypeptides, wherein each polypeptide 
comprises at least one epitope derived fi-om an antigen associated with a target cell. 

30 149. The nucleic acid construct of claim 129 wherein the biduectional promoter is 

further operably linked to a second sequence, said second sequence encoding an adjuvant. 

150. The nucleic acid construct of claim 136 wherein the encoded peptide possesses 
autoproteolytic activity. 

151. The nucleic acid construct of claim 136 wherein tilie encoded peptide inhibits 
35 peptide bond formation. 
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152. A vaccine comprising the nucleic acid construct of any of claims 109-151. 

153. A method of making a vaccine, comprising the step of: 

preparing a vaccine composition comprising the nucleic acid construct of any of 
claims 109-151. 

5 154. A vaccine made according to the method of claim 41 or 153, 

155. A metiiodoftreating an animal comprising: 

administering to an anhnal the vaccine any of clauns 1-40, 81, 152, or 154. 

156. The method of claim 155, wherein the administering step comprises a mode of 
delivery selected from die group consisting of transdermal, intranodal, perinodal, oral, intravenous, 

10 intradermal, mtramuscular, intr^eritoneal, and mucosal. 

157. The method of claim 155, further comprising a step of assaying to determine a 
characteristic indicative of a state of die target cells. 

158. The method of claim 157, comprising a first assaying step and a second assaying 
step, wherem the first assaying step precedes the administering step, and wherein die second 

15 assaying step follows the administering step. 

159. The method of claim 158, furdier comprising a step of comparing the characteristic 
determined in the first assaying step with the characteristic determined in the second assaymg step 
to obtain a result. 

160. The method of claim 159, wherein the result is selected from the group consisting 
20 of: a diminution in number of target cells, a loss of mass or size of a tumor comprising target cells, 

or a decrease in number or concentration of an intracellular parasite infecting target cells. 

161. The use of a housekeeping epitope in the manu&cture of a vaccine for use in the 
method of treatment in any claims 1 55- 1 60. 
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SEQUENCE LISTING 

4 

<110> CTt, IMMONOTHERAPIES CORP, 
SIMARD, John, J., L. 
DIAMOND, David, C. 
IiBl , Xiang-Dong 

<120> EPITOPE SYNCHRONIZATION IN ANTIGEN 
PRESENTING CEUjS 



<130> CTLIMM.004VPC 

<150> US 09/561,572 
<151> 2000-04-28 



<150> US 09/561,571 
<151> 2000-04-28 



<150> US 09/561,074 
<151> 2000-04-28 

<150> US 09/560,465 
<151> 2000-04-28 

<160> 82 



<:170> PastSEQ for Windows Version 4.0 

<210> 1 
<211> 118 
<212> PRT 

<213> Homo sapiens 



<400> 1 



Met 


Pro 


Arg Glu 


Asp Ala 


1 






5 


His 


Gly 


His Ser 


Tyr Thr 






20 




Leu 


Thr 


Val He 


Leu Gly 






35 


Arg 


Arg 


Arg Asn 


Gly Tyr 




50 




Gly 


Thr 


Gin Cys 


Ala Leu 


65 






70 


His 


Arg 


Asp Ser 


Lys Val 








85 


Val 


Pro 


Asn Ala 


Pro Pro 






100 




Pro 


Pro 


Pro Tyr 


Ser Pro 






115 





10 15 
Thr Ala Glu Glu Ala Ala Gly He Gly ll( 

25 30 
Val Leu Leu Leu He Gly Cys Trp Tyr Cy* 

40 45 
Arg Ala Leu Met Asp Lys Ser Leu His Va] 
55 60 
Thr Arg Arg Cys Pro Gin Glu Gly Phe Asi 
75 80 
Ser Leu Gin Glu Lys Asn Cys Glu Pro Va3 

90 95 
Ala Tyr Glu Lys Leu Ser Ala Glu Gin Sei 
105 110 



<210> 2 
<211> 188 
<212> PRT 

<213> Homo sapiens 



<400> 2 
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He Pro Glu Lys He Gin Lys Ala Phe Asp Asp He Ala Lys Tyr Phe 

20 25 30 

Ser Lys Glu Glu Trp Glu Lys Met Lys Ala Ser Glu Lys He Phe Tyr 

35 40 45 

Val Tyr Met Lys Arg Lys Tyr Glu Ala Met Thr Lys Leu Gly Phe Lys 

50 55 60 

Ala Thr Leu Pro Pro Phe Met Cys Asn Lys Arg Ala Glu Asp Phe Gin 
" 70 75 80 

Gly Asn Asp Leu Asp Asn Asp Pro Asn Arg Gly Asn Gin Val Glu Arg 

85 90 95 

Pro Gin Met Thr Phe Gly Arg Leu Gin Gly He Ser Pro Lys He Met 

100 105 110 

Pro Lys Lys Pro Ala Glu Glu Gly Asn Asp Ser Glu Glu Val Pro Glu 

115 120 125 

Ala Ser Gly Pro Gin Asn Asp Gly Lys Glu Leu Cys Pro Pro Glv Lvs 

130 135 140 y y 

Pro Thr Thr ser Glu Lys He His Glu Arg Ser Gly Pro Lys Arg Gly 

150 155 160 

Glu Hxs Ala Trp Thr His Arg Leu Arg Glu Arg Lys Gin Leu Val He 

165 170 175 

Tyr Glu Glu He Ser Asp Pro Glu Glu Asp Asp Glu 
180 185 



<210> 3 

<211> 180 

<212> PRT 

<213> Homo sapiens 

<400> 3 

Met Gin Ala Glu Gly Arg Gly Thr Gly Gly Ser Thr Gly Asp Ala Asp 

15 10 15 

Gly Pro Gly Gly Pro Gly He Pro Asp Gly Pro Gly Gly Asn Ala Gly 

20 25 30 

Gly Pro Gly Glu Ala Gly Ala Thr Gly Gly Arg Gly Pro Arg Gly Ala 

35 40 45 

Gly Ala Ala Arg Ala Ser Gly Pro Gly Gly Gly Ala Pro Arg Gly Pro 

50 55 60 

His Gly Gly Ala Ala Ser Gly Leu Asn Gly Cys Cys Arg Cys Gly Ala 
^5 70 75 80 

Arg Gly Pro Glu Ser Arg Leu Leu Glu Phe Tyr Leu Ala Met Pro Phe 

85 90 95 

Ala Thr Pro Met Glu Ala Glu Leu Ala Arg Arg Ser Leu Ala Gin Asp 

100 105 110 

Ala Pro Pro Leu Pro Val Pro Gly Val Leu Leu Lys Glu Phe Thr Val 

115 120 125 

Ser Gly Asn He Leu Thr He Arg Leu Thr Ala Ala Asp His Arg Gin 

130 135 140 

Leu Gin Leu Ser He Ser Ser Cys Leu Gin Gin Leu Ser Leu Leu Met 

150 155 160 

Trp He Thr Gin Cys Phe Leu Pro Val Phe Leu Ala Gin Pro Pro Ser 
165 170 175 

Gly Gin Arg Arg 
180 



<210> 4 
<211> 529 
<212> PRT 

<213> Homo sapiens 
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<400> 4 

Met Leu Leu Ala Val Leu Tyr Cys Leu Leu Tip Ser Phe Qln Thr Ser 

^5 10 15 

Ala Gly His Phe Pro Arg Ala Cys Val Ser Ser Lys Asn Leu Met Glu 

20 25 30 

Lys Glu Cys Cys Pro Pro Trp Ser Gly Asp Arg Ser Pro Cys Gly Gin 

35 40 45 

Leu Ser Gly Arg Gly Ser Cys Gin Asn lie Leu Leu Ser Asn Ala Pro 
. 5° 55 60 

Leu Gly Pro Gin Phe Pro Phe Thr Gly Val Asp Asp Arg Glu Ser Trp 

■'O 75 80 

Pro Ser Val Phe Tyr Asn Arg Thr Cys Gin Cys Ser Gly Asn Phe Met 

85 90 95 

Gly Phe Ash Cys Gly Asn Cys Lys Phe Gly Phe Trp Gly Pro Asn Cys 

100 105 110 

Thr Glu Arg Arg Leu Leu Val Arg Arg Asn lie Phe Asp Leu Ser Ala 

120 125 
Pro Glu Lys Asp Lys Phe Phe Ala Tyr Leu Thr Leu Ala Lys His Thr 
130 135 

lie Ser Ser Asp Tyr- Val lie Pro lie Gly Thr Tyr Gly Gin Met Lvs 

150 155 160 

Asn Gly Ser Thr Pro Met Phe Asn Asp He Asn He Tyr Asp Leu Phe 

1^5 170 
Val Trp Met His Tyr Tyr Val Ser Met Asp Ala Leu Leu Gly Gly Ser 

180 185 190 

Glu He Trp Arg Asp He Asp Phe Ala His Glu Ala Pro Ala Phe Leu 

195 200 205 

Pro Trp His Arg Leu Phe Leu Leu Arg Trp Glu Gin Glu He Gin Lvs 

210 215 220 

Leu Thr Gly Asp Glu Asn Phe Thr He Pro Tyr Trp Asp Trp Arg Asp 
^25 230 235 240 

Ala Glu Lye Cys Asp He Cys Thr Asp Glu Tyr Met Gly Gly Gin His 
r. , 250 255 

Pro Thr Asn Pro Asn Leu Leu Ser Pro Ala Ser Phe Phe Ser Ser Trp 

260 265 270 

Gin He Val Cys Ser Arg Leu Glu Glu Tyr Asn Ser His Gin Ser Leu 

275 280 285 

Cys Asn Gly Thr Pro Glu Gly Pro Leu Arg Arg Asn Pro Gly Asn His 

290 295 300 

Asp Lys Ser Arg Thr Pro Arg Leu Pro Ser Ser Ala Asp Val Olu Phe 
i?^ 310 315 320 

Cys Leu Ser Leu Thr Gin Tyr Glu Ser Gly Ser Met Asp Lys Ala Ala 

325 330 
Asn Phe Ser Phe Arg Asn Thr Leu Glu Gly Phe Ala Ser Pro Leu Thr 

340 345 
Gly He Ala Asp Ala Ser Gin Ser Ser Met His Asn Ala Leu His He 

355 360 365 

Tyr Met Asn Gly Thr Met Ser Gin Val Gin Gly Ser Ala Asn Asp Pro 

370 375 380 

He Phe Leu Leu His His Ala Phe Val Asp Ser He Phe Glu Gin Trp 

390 395 400 

Leu Arg Arg His Arg Pro Leu Gin Glu Val Tyr Pro Glu Ala Asn Ala 

405 410 
Pro He Gly His Asn Arg Glu Ser Tyr Met Val Pro Phe He Pro Leu 

420 425 430 

Tyr Arg Asn Gly Asp Phe Phe He Ser Ser Lys Asp Leu Gly Tyr Asp 

435 440 
Tyr Ser Tyr Leu Qln Asp Ser Asp Pro Asp ser Phe Gin Asp Tvr' He 

450 455 460 

Lys Ser Tyr Leu Glu Gin Ala Ser Arg He Trp Ser Trp Leu Leu Glv 

470 475 480 
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Ala Ala Met 



Val Gly Ala Val Leu Thr Ala Leu Leu Ala Gly Leu Val 

495 

Cys Arg His Lys Arg Lys Gin Leu Pro Glu Glu Lys Gin 
500 505 510 

Met Glu Lys Glu Asp Tyr His Ser Leu Tyr Gin Ser His 
520 525 



Ser Leu Leu 



Pro Leu Leu 
515 

Leu 



<210> 5 
<211> 228 
<212> DNA 

<213> Saccharomyces Cerevisiae 

<400> 5 

atgcagattt tcgtcaagac tttgaccggt aaaaccataa cattggaagt tgaatcttcc 60 
gataccatcg acaacgttaa gtcgaaaatt caagacaagg aaggtatccc tccagatcaa 120 
caaagattga tctttgccgg taagcagcta gaagacggta gaacgctgtc tgattacaac 180 
attcagaagg agtccacctt acatcttgtg ctaaggctaa gaggtggc 228 

<210> 6 
<211> 581 
<212> DNA 

<213> Encephalomyocarditis Virus 



aattccgccc ctctccctcc ccccccccta acgttactgg ccgaagccgc ttggaataag 60 
gccggtgtgc gtttgtctat atgtgatttt ccaccatatt gccgtctttt ggcaatgtga 120 
gggcccggaa acctggccct gtcttcttga cgagcattcc taggggtctt tcccctctcg 180 
ccaaaggaat gcaaggtctg ttgaatgtcg tgaaggaagc agttcctctg gaagcttctt 240 
gaagacaaac aacgtctgta gcgacccttt gcaggcagcg gaacccccca cctggcgaca 300 
ggtgcctctg cggccaaaag ccacgtgtat aagatacacc tgcaaaggcg gcacaacccc 360 
agtgccacgt tgtgagttgg atagttgtgg aaagagtcaa atggctctcc tcaagcgtat 420 
tcaacaaggg gctgaaggat gcccagaagg taccccattg tatgggatct gatctggggc 480 
ctcggtgcac atgctttaca tgtgtttagt cgaggttaaa aaaacgtcta ggccccccga 540 
accacgggga cgtggttttc ctttgaaaaa cacgatgata a sBl 

<210> 7 
<211> 16 
<212> PRT 

<213> Homo sapiens 



Leu Leu Ala Val Leu Tyr Cys Leu Leu Trp Ser Phe Gin Thr Ser 



<210> 8 

<211> 22 

<212> PRT 

<213> Homo sapiens 



<400> 6 



<400> 8 




<210> 9 
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<211> 24 
<212> PRT 

<213> Homo sapiens 



<400> 9 

Qln Ala Ala Ser Ser Ser Ser Thr 

1 5 
Val Pro Ala Ala Glu Ser Pro Asp 
20 



Leu Val Glu Val Thr Leu Gly Glu 
10 15 



<210> 10 
<211> 24 
<212> PRT 

<213> Homo sapiens 



<400> 10 

Glu Phe Leu Trp Gly Pro Arg Ala 

1 5 
Val Leu His His Met Val Lys He 
20 



Leu Val Glu Thr Ser Tyr Val Lys 
10 15 



<210> 11 
<211> 19 
<212> PRT 

<213> Homo sapiens 



<400> 11 

Ala Pro Glu Glu Lys He Trp Glu 

1 5 
Glu Gly Arg 



Glu Leu Ser Val Leu Glu Val Phe 
10 15 



<210> 12 
<211> 16 
<212> PRT 

<213> Homo sapiens 
<400> 12 



Glu Leu Met Glu Val Asp Pro He Gly His Leu Tyr He Phe Ala Thr 
^5 10 15 



<210> 13 
<211> 15 
<212> PRT 

<213> Homo sapiens 



<400> 13 

Phe Leu Trp Gly Pro Arg Ala Leu Val Glu Thr Ser Tyr Val Lys 
15 10 15 



<210> 14 
<211> 16 
<212> PRT 
<213> Homo 

<400> 14 



sapiens 
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Met Leu Leu Ala Val Leu Tyr Cys Leu Leu Trp Ser Phe Gin Thr Ser 
15 10 15 



<210> 15 

<211> 22 

<212> PRT 

<213> Homo sapiens 



<400> 15 

His ser Tyr Thr Thr Ala Qlu Glu Ala Ala Gly He Thr He Leu Thr 

3-5 10 15 

Val He Leu Gly Val Leu 
20 



<210> 16 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 16 

Glu Ala Ala Ser Ser Ser Ser Thr Leu Val Glu Val Thr Leu Gly Glu 

15 10 15 

Val Pro Ala Ala Glu Ser Pro Asp 
20 



<210> 17 

<211> 24 

<212> PRT 

<213> Homo sapiens 

<400> 17 

Glu Phe Leu Trp Gly Pro Arg Ala Leu Val Glu Thr Ser Tyr Val Lys 

1 5 10 15 

Val Leu' His His Met Val Lys He 
20 



<210> 18 
<211> 19 
<212> PRT 
<213> Homo 



sapiens 



<400> 18 

Ala Pro Glu Glu Lys He Trp Glu Glu Leu Ser Val Leu Glu Val Phe 

15 10 15 

Glu Gly Arg 



<210> 19 

<211> 16 

<212> PRT 

<213> Homo sapiens 

<400> 19 

Glu Leu Met Glu Val Asp Pro He Gly His Leu Tyr He Phe Ala Thr 
15 10 15 
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<210> 20 

<211> 15 

<212> PRT 

<213> Homo sapiens 



<400> 20 

Phe Leu Trp Gly Pro Arg Ala Leu Val Glu Thr Ser Tyr Val Lys 
1 5 10 



<210> 21 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 21 

Leu Oln Leu Ser. Asn Gly Asn Arg Thr Leu 
15 10 



<210> 22 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 22 

Leu Gin Leu Ser Asn Asp Asn Arg Thr Leu 
15 10 



<210> 23 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 23 

Leu Gin Leu Ser Asn Gly Asn Arg Thr Leu 
15 10 



<210> 24 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 24 

Leu Leu Val His Asn Leu Pro Gin His Leu 
15 10 



<210> 25 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 25 

Leu Ser Asn Asp Asn Arg Thr Leu Thr Leu 
15 10 



15 
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<210> 26 
<211> 10 
<212> PRT 

<2X3> Homo sapiens 
<400> 26 

Leu Ser Asa Gly Asn Arg Thr Leu Thr Leu 
15 10 



<210> 27 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400>. 27 

Leu Ser Asn Gly Asn Arg Thr Leu Thr Leu 
1 5 10 



<210> 28 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 28 

Cys Gin Ala His Asn Ser Asp Thr Gly Leu 
15 10 



<210> 29 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 29 

lie He Tyr Pro Asn Ala Ser Leu Leu He 
1 5 10 



<210> 30 
<211> 10 
<212> PRT 

<213> Homo sapiens 

<400> 30 

Ser Ala Asn Arg Ser Asp Pro Val Thr Leu 
15 10 



<210> 31 
<211> 10 
<:212> PRT 

<213> Homo sapiens 

<400> 31 

Ser Val Ser Ala Asn Arg Ser Asp Pro Val 
15 10 



<210> 32 
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<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 32 

Ser lie Ser Ser Asn Asn Ser Lys Pro Val 
^5 10 



<210> 33 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 33 

Trp Trp Val Asn Asn Qln Ser Leu Pro Val 
15 10 



<210> 34 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 34 

Trp Trp Val Asn Asn Gin ser Leu Pro Val 
^5 10 



<210> 35 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 35 

Ser He Ser Ser Asn Asn Ser Lys Pro Val 
1 5 10 



<210> 36 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 36 

Ser Asp Leu Val Asn Glu Glu Ala Thr Gly 
^5 10 



<210> 37 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 37 

He Thr Pro Asn Asn Asn Gly Thr Tyr Ala 
15 10 



<aiO> 38 
<211> 10 
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<212> PRT 

<213> Homo sapiens 



<400> 38 

Ser Leu Pro Val Ser Pro Arg Leu Gin Leu 
15 10 



<210> 39 
<211> 10 
<212> PRT 
<213> Homo 



sapiens 



<400> 39 

Ser Leu Pro Val Ser Pro Arg Leu Gin Leu 
1 5 10 



<210> 40 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 40 

Phe lie Thr Ser Asn Asn Ser Asn Pro Val 
15 10 

<210> 41 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 41 

Arg Ser Gly Glu Asn Leu Asn Leu Ser Cys 
15 10 



<210> 42 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 42 

Ser Leu Pro Val Ser Pro Arg Leu Gin Leu 
15 10 



<210> 43 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 43 

Val Tyr Pro Glu Leu Pro Lys Pro Ser lie 
15 10 



<210> 44 
<211> 10 
<212> PRT 
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<213> Homo sapiens 
<400> 44 

Ser Thr Pro Phe Asn Val Ala Glu Gly Lys 
15 10 



<210> 45 

<21X> 10 

<212> PRT 

<213> Homo sapiens 

<400> 45 

Asn Pro Pro Thr Thr Ala Lys lieu Thr lie 
15 10 



<210> 46 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 46 

Cys Phe Val Ser Asn Leu Ala Thr Gly Arg 
1 5 .10 



<210> 47 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 47 

Gly Ala Asn Leu Asn Leu Ser Cys His Ser 
1 5 10 



<210> 48 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 48 

Leu Ser Gly Ala Asn Leu Asn Leu Ser Cys 
15 10 



<210> 49 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 49 

Ser Tyr Leu Ser Gly Ala Asn Leu Asn Leu 

15 10 



<210> 50 
<211> 10 
<212> PRT 

<213> Homo sapiens 
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<400> 50 

Cys Gly He Gin Asn Ser Val Ser Ala Asn 
15 10 



<210> 51 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 51 

Val Ser Ala Glu Leu Pro Lys Pro Ser He 
1 5 10 



<210> 52 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 52 

Ser Asn He Thr Glu Lys Asn Ser Gly Leu 
15 10 



<210> 53 
<211> 10 
<212> PRT 
<213> Homo 



sapiens 



<400> 53 

Gly Asn He Gin Gin His Thr Gin Glu Leu 
15 10 



<210> 54 

<211> 10 

<212> PRT 

<213> Homo sapiens 



<400> 54 

Ser Asn Pro Pro Ala Gin Tyr Ser Trp Leu 
15 10 



<210> 55 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 55 

Cys Gly He Gin Asn Glu Leu Ser Val Asp 
15 10 



<210> 56 

<211> 10 

<212> PRT ■ 

<213> Homo sapiens 
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<400> 56 

He Ser Pro Leu Asn Thr Ser Tyr Arg Ser 
15 10 



<210> 57 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 57 

Pro Gly Pro Ala Tyr Ser Gly Arg Glu He 
15 10 



<210> 58 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 58 

Phe Asn Val Ala Glu Gly Lys Glu Val Leu 
1 5 10 



<210> 59 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 59 

lie Gly Val Leu Val Gly Val Ala Leu He 
15 10 



<210> 60 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 60 

Ser Ala Gly Ala Thr Val Gly He Met He 
1 5 10 



<210> 61 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 61 

Ser Lys Pro Val Glu Asp Lys Asp Ala Val 
15 10 



<210> 62 

<211> 10 

<212> PRT 

<213> Homo sapiens 

<400> 62 
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Ser Ala Ser Gly His Ser Arg Thr Thr Val 
15 10 



<210> 63 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 63 

Arg Pro Gly Val Asn Leu Ser I,eu Ser Cys 
15 10 



<210> 64 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 64 

Leu Ser Val Asp His Ser Asp Pro Val lie 
15 10 



<210> 65 
<211> 10 
<212> PRT 

<213> Homo sapiens 

<400> 65 

Ser Asn Asp Asn Arg Thr Leu Thr Leu Leu 
15 10 



<210> 66 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 66 

Ser Asn Pro Val Glu Asp Glu Asp Ala Val 
15 10 



<210> 67 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 67 

Ser Asn Asn Ser Asn Pro Val Glu Asp Glu 
15 10 



<210> 68 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 68 

Gly Leu Asn Arg Thr Thr Val Thr Thr lie 
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10 



<210> 69 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 69 

He Thr Val Asn Asu Ser Gly Ser Tyr Thr 
1 5 10 



<210> 70 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 70 

Ser Thr Gin Glu Leu Phe He Pro Asn He 
1 5 10 



<210> 71 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 71 

Phe Gin Gin Ser Thr Gin Glu Leu Phe He 
15 10 



<210> 72 
<211> 10 
<212> PRT 

<213> Homo sapiens 

<400> 72 

Ser Ala Arg Arg Ser Asp Ser Val He Leu 
15 10 



<210> 73 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 73 

Val Ser Ala Arg Arg Ser Asp Ser Val He 
15 10 



<210> 74 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 74 

Ser Lys Pro Val Glu Asp Lys Asp Ala Val 
15 10 
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<210> 75 
<211> 10 
<212> PRT 

<213> Homo sapiens 

<400> 75 

Thr Gly Gin Phe Arg Val Tyr Pro Glu Leu 
15 10 

<210> 76 
<211> 10 
<212> PRT 

<213> Homo sapiens 

<400> 76 

Lys Glu Val Leu Leu Leu Val His Asn Leu 
15 10 



<210> 77 
<211> 10 

<212> PRT 

<213> Homo sapiens 
<400> 77 

Ala Lys Leu 1^ lie Glu Ser Thr Pro Phe 
1 5 10 



<210> 78 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 78 

Phe Leu Pro Trp His Arg Leu Phe Leu Leu 
15 10 



<210> 79 
<211> 10 
<212> PRT 

<213> Homo sapiens 
<400> 79 

Phe Leu Pro Ser Asp Tyr Phe Pro Ser Val 
15 10 



<210> 80 
<211> 9 
<212> PRT 

<213> Homo sapiens 
<400> 80 

Ala Glu Met Gly Lys Tyr Ser Phe Tyr 
1 5 
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<210> 81 
<211> 72 
<212> DNA 
<213> SV40 

<220> 

<223> Synthetic Nucleotide Sequence 
<400> 81 

tggttgctga ctaattgaga tgcatgcttt gcatacttct gcctgctggg gagcctgggg 60 
actttccaca cc 72 

<210> 82 
<211> 6 
<212> DNAl 

<213> Artificial Sequence 
<220> 

<223> Synthetic Nucleotide Sequence 
<400> 82 

aacgtt 6 
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